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Preface 


This  volume  contains  some  of  the  lectures  presented  at  the  1994  AMS /SIAM 
Summer  Seminar,  held  June  20-July  1  at  the  Mathematical  Sciences  Research 
Institute  in  Berkeley.  It  was  the  intent  of  the  organizers  of  the  summer  seminar 
to  introduce  the  participants  to  as  many  of  the  interesting  and  active  applications 
of  dynamical  systems  to  problems  in  applied  mathematics  as  the  time  constraints 
of  the  workshop  allowed.  Consequently,  this  book  covers  a  great  deal  of  ground. 
Nonetheless,  the  pedagogical  orientation  of  the  lectures  has  been  retained  in  this 
volume,  and  as  such,  we  hope  that  it  will  serve  as  an  ideal  introduction  to  these 
varied  and  interesting  topics. 

While  the  focus  of  the  workshop  was  quite  broad,  several  organizing  prin¬ 
ciples  emerged.  The  first  was  the  increasing  role  of  dynamical  systems  theory 
in  our  understanding  of  partial  differential  equations.  The  first  three  contribu¬ 
tions  of  the  present  volume  are  devoted  to  this  theme.  In  particular,  all  of  these 
lecturers  stressed  the  importance  that  the  geometrical  structures  present  in  the 
phase  spaces  of  these  systems  have  for  our  understanding  of  their  dynamics,  A 
second  theme  was  the  central  importance  of  certain  prototypical  partial  differ¬ 
ential  equations.  These  equations,  which  include  the  complex  Ginzburg-Landau, 
nonlinear  Schrodinger  and  Korteweg-de  Vries  equations,  arise  in  many  different 
contexts  and  hence  have  an  importance  that  transcends  their  apparently  special 
form.  In  this  book,  two  sets  of  lectures  explore  this  phenomenon  in  greater  detail 
for  the  complex  Ginzburg-Landau  equation,  one  examining  in  detail  the  sorts  of 
phenomena  that  can  arise  in  this  equation,  and  the  other  focussed  on  showing  rig¬ 
orously  how  a  knowledge  of  the  behavior  of  the  solutions  of  the  Ginzburg-Landau 
equation  implies  information  about  a  host  of  more  complicated  systems.  In  addi¬ 
tion  to  their  ubiquity,  the  nonlinear  Schrodinger  and  Korteweg-de  Vries  equation 
share  the  additional  remarkable  property  of  being  completely  integrable.  The 
meaning  and  consequences  of  complete  integrability  are  explored  in  the  lectures 
of  section  2.  Finally,  the  last  set  of  lectures  looks  specifically  at  problems  in  fluid 
mechanics  and  turbulence.  More  specifically,  it  examines  the  extent  to  which 
one  can  determine  the  limits  of  popular  physically  motivated  heuristic  theories 
of  fluids  like  the  renormalization  group  and  the  Kolmogorov  scaling  law. 

Percy  Deift 
C.  David  Levermore 
C.  Eugene  Wayne 
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An  Introduction  to  KAM  Theory 

C.  EUGENE  WAYNE 


1.  Introduction 

Over  the  past  thirty  years,  the  Kolmogorov- Arnold-Moser  (KAM)  theory  has 
played  an  important  role  in  increasing  our  understanding  of  the  behavior  of  non- 
integrable  Hamiltonian  systems.  I  hope  to  illustrate  in  these  lectures  that  the 
central  ideas  of  the  theory  are,  in  fact,  quite  simple.  With  this  in  mind,  I  will 
concentrate  on  two  examples  and  will  forego  generality  for  concreteness  and  (I 
hope)  clarity.  The  results  and  methods  which  I  will  present  are  well-known  to 
experts  m  the  field  but  I  hope  that  by  collecting  and  presenting  them  in  as 
simple  a  context  as  possible  I  can  make  them  somewhat  more  approachable  to 
newcomers  than  they  are  often  considered  to  be. 

The  outline  of  the  lectures  is  as  follows.  After  a  short  historical  introduction, 
I  will  explain  in  detail  one  of  the  simplest  situations  where  the  KAM  techniques 
are  used  -  the  case  of  diffeomorphisms  of  a  circle.  I  will  then  go  on  to  discuss 
the  theory  in  its  original  context,  that  of  nearly-integrable  Hamiltonian  systems. 

The  problem  which  the  KAM  theory  was  developed  to  solve  first  arose  in  ce¬ 
lestial  mechanics.  More  than  300  years  ago,  Newton  wrote  down  the  differential 
equations  satisfied  by  a  system  of  massive  bodies  interacting  through  gravita¬ 
tional  forces.  If  there  are  only  two  bodies,  these  equations  can  be  explicitly 
solved  and  one  finds  that  the  bodies  revolve  on  Keplerian  ellipses  about  their 
center  of  mass.  If  one  considers  a  third  body  (the  “three-body-problem”),  no 
exact  solution  exists  -  even  if,  as  in  the  solar  system,  two  of  the  bodies  are 
much  lighter  then  the  third.  In  this  case,  however,  one  observes  that  the  mutual 
gravitational  force  between  these  two  “planets”  is  much  weaker  than  that  be¬ 
tween  either  planet  and  the  sun.  Under  these  circumstances  one  can  try  to  solve 
the  problem  perturbatively,  first  ignoring  the  interactions  between  the  planets. 
This  gives  an  integrable  system,  or  one  which  can  be  solved  explicitly,  with 
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each  planet  revolving  around  the  sun  oblivious  of  the  other’s  existence.  One 
can  then  try  to  systematically  include  the  interaction  between  the  planets  in  a 
perturbative  fashion.  Physicists  and  astronomers  used  this  method  extensively 
throughout  the  nineteenth  century,  developing  series  expansions  for  the  solutions 
of  these  equations  in  the  small  parameter  represented  by  the  ratio  of  the  mass 
of  the  planet  to  the  mass  of  the  sun.  However,  the  convergence  of  these  series 
was  never  established  -  not  even  when  the  King  of  Sweden  offered  a  very  sub¬ 
stantial  prize  to  anyone  who  succeeded  in  doing  so.  The  difficulty  in  establishing 
the  convergence  of  these  series  comes  from  the  fact  that  the  terms  in  the  series 
have  small  denominators  which  we  shall  consider  in  some  detail  later  in  these 
lectures.  One  can  obtain  some  physical  insight  into  the  origin  of  these  conver¬ 
gence  problems  in  the  following  way.  As  one  learns  in  an  elementary  course  in 
differential  equations,  a  harmonic  oscillator  has  a  certain  natural  frequency  at 
which  it  oscillates.  If  one  subjects  such  an  oscillator  to  an  external  force  of  the 
gamp  frequency  as  the  natural  frequency  of  the  oscillator,  one  has  resonance 
effects  and  the  motion  of  the  oscillator  becomes  unbounded.  Indeed,  if  one  has  a 
typical  nonlinear  oscillator,  then  whenever  the  perturbing  force  has  a  frequency 
that  is  a  rational  multiple  of  the  natural  frequency  of  the  oscillator,  one  will  have 
resonances,  because  the  nonlinearity  will  generate  oscillations  of  all  multiples  of 
the  basic  driving  frequency. 

In  a  similar  way,  one  planet  exerts  a  periodic  force  on  the  motion  of  a  second, 
and  if  the  orbital  periods  of  the  two  are  commensurate,  this  can  lead  to  resonance 
and  instability.  Even  if  the  two  periods  are  not  exactly  commensurate,  but  only 
approximately  so  the  effects  lead  to  convergence  problems  in  the  perturbation 
theory. 

It  was  not  until  1954  that  A.  N.  Kolmogorov  [8]  in  an  address  to  the  ICM  in 
Amsterdam  suggested  a  way  in  which  these  problems  could  be  overcome.  His 
suggestions  contained  two  ideas  which  are  central  to  all  applications  of  the  KAM 
techniques.  These  two  basic  ideas  are: 

•  Linearize  the  problem  about  an  approximate  solution  and  solve  the  lin¬ 
earized  problem  —  it  is  at  this  point  that  one  must  deal  with  the  small 
denominators. 

•  Inductively  improve  the  approximate  solution  by  using  the  solution  of 
the  linearized  problem  as  the  basis  of  a  Newton  s  method  argument. 

These  ideas  were  then  fleshed  out,  extended,  and  applied  in  numerous  other 
contexts  by  V.  Arnold  and  J.  Moser,  ([1],  [9])  over  the  next  ten  years  or  so, 
leading  to  what  we  now  know  as  the  KAM  theory. 

As  I  said  above,  we  will  consider  the  details  of  this  procedure  in  two  cases. 
The  first,  the  problem  of  showing  that  diffeomorphisms  of  a  circle  are  conju¬ 
gate  to  rotations,  was  chosen  for  its  simplicity  -  the  main  ideas  are  visible  with 
fewer  technical  difficulties  than  appear  in  other  applications.  We  will  then  look 
at  the  KAM  theory  in  its  original  setting  of  small  perturbations  of  integrable 
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Hamiltonian  systems.  I’ll  attempt  to  parallel  the  discussion  of  the  case  of  circle 
diffeomorphisms  as  closely  as  possible  in  order  to  keep  our  focus  on  the  main 
ideas  of  the  theory  and  ignore  as  much  as  possible  the  additional  technical  com¬ 
plications  which  arise  in  this  context. 

Acknowledgments:  It  is  a  pleasure  to  thank  Percy  Deift  and  Andrew  Torok 
for  many  helpful  comments  about  these  notes. 


6 


C.  EUGENE  WAYNE 


2.  Circle  DifFeomorphisms 

Let  us  begin  by  discussing  one  of  the  simplest  examples  in  which  one  encoun¬ 
ters  small  denominators,  and  for  which  the  KAM  theory  provides  a  solution. 
It  may  not  be  apparent  for  the  moment  what  this  problem  has  to  do  with  the 
problems  of  celestial  mechanics  discussed  in  the  introduction,  but  almost  all  of 
the  difficulties  encountered  in  that  problem  also  appear  in  this  context  but  m 
ways  which  are  less  obscured  by  technical  difficulties  -  this  is,  if  you  like,  our 
warm-up  exercise. 

We  will  consider  orientation  preserving  diffeomorphisms  of  the  circle,  or  equiv¬ 
alently,  their  lifts  to  the  real  line: 

0(x)  =  a;  +  fj{x)  with  fj{x  +  1)  =  fj{x)  and  f]'{x)  >  —1  . 

We  wish  to  consider  (/>  as  a  dynamical  system,  and  study  the  behavior  of  its 
“orbits”  —  i.e.  we  want  to  understand  the  behavior  of  the  sequences  of  points 
{(^('^l(a;)(modl)}^05  where  means  the  n-fold  composition  of  (j)  with  itself. 
Typical  questions  of  interest  are  whether  or  not  these  orbits  are  periodic,  or 
dense  in  the  circle. 

The  simplest  such  diffeomorphism  is  a  rotation  Ra{x)  =  x a.  Note  that 
we  understand  “everything”  about  its  dynamics.  For  instance,  if  a  is  rational, 
all  the  orbits  of  Ra  are  periodic,  and  none  are  dense.  However,  we  would  like 
to  study  more  complicated  dynamical  systems  than  this.  Thus  we  will  suppose 
that 

(1)  (t>{x)  =x-\-a^rf{x)  , 

where  as  before,  r}{x  +  1)  =  r}{x)  and  7]'{x)>  -1.  As  I  said  in  the  introduction, 
I  will  not  attempt  to  consider  the  most  general  case,  but  rather  will  focus  on 
simplicity  of  exposition.  Thus  I  will  consider  only  analytic  diffeomorphisms. 
Define  the  strips  =  {z  e  C  \  \Imz\  <  a}.  Then  I  will  assume  that 

T)  e  Ba  =1^  I  r]{z)  is  analytic  on  S^, 

T]{x  +  1)  =  T]{x)  and  sup  \7]{z)\  =  <  oo}  . 

\Imz\<o- 

Note  that  one  can  assume  that  a  <1,  without  loss  of  generality. 

Our  goal  in  this  section  will  be  to  understand  the  dynamics  of  (j>{x)  =  x  + 
a  +  f}(x)  when  t]  has  small  norm.  One  way  to  do  this  is  to  show  that  the 
dynamics  of  4>  are  “like”  the  dynamics  of  a  system  we  understand  -  for  instance, 
suppose  that  we  could  find  a  change  of  variables  which  transformed  ^  into  a  pure 
rotation.  Then  since  we  understand  the  dynamics  of  the  rotation,  we  would  also 
understand  those  of  <j).  If  we  express  this  change  of  variables  as  x  =  H{0,  where 
=  1  H{^)  preserves  the  periodicity  of  </>,  then  we  want  to  find  H  such 

that 


H-^o<j>oH{0  =  Rf,iO  - 
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or  equivalently 

(2)  =  . 

Such  a  change  of  variables  is  said  to  conjugate  4>  to  the  rotation  R^. 

Remark  2.1.  The  relationship  between  this  problem  and  the  celestial  mechan¬ 
ics  questions  discussed  in  the  introduction  now  becomes  more  clear.  In  that  case 
we  wanted  to  understand  the  extent  to  which  the  motion  of  the  solar  system  when 
we  included  the  effects  of  the  gravitational  interaction  between  the  various  planets 
was  “like”  that  of  the  simple  Kepler  system. 

In  order  to  answer  this  question  we  need  to  introduce  an  important  charac- 
teristic  of  circle  diffeomorphisms,  the  rotation  number 

Definition  2.1,  The  rotation  number  of  (p  is 

p(4>)  =  hm  . 

n-+oo  n 

Remark  2.2.  It  is  a  standard  result  of  dynamical  systems  theory  that  for  any 
homeomorphism  of  the  circle  the  limit  on  the  right  hand  side  of  this  equation 
exists  and  is  independent  of  x.  (See  [6],  p.  296.) 

Remark  2.3.  Note  that  from  the  definition  of  the  rotation  number,  it  follows 
immediately  that  for  any  homeomorphism  H,  the  map  ^  o  (p  o  H  has  the 

same  rotation  number  as  0.  (Since  =  if-i  o  o  H,  and  the  inUial  and 
final  factors  of  H  and  have  no  effect  on  the  limit.) 

As  a  final  remark  about  the  rotation  number  we  note  that  if  (p(x)  =  x-\- a -\- 
r}{x),  then  an  easy  induction  argument  shows  that  p{(p)  ~  a+limn^oo  ^ 

In  particular,  if  a  =  p,  we  have  lim„_oo  1  V  °  =  o! so  we 

have  proved: 

Lemma  2.1.  If  (p{x)  ~  x  -\-  p-\-T}{x)  has  rotation  number  p,  then  there  exists 
some  xq  such  that  r]{xo)  =  Q. 

We  must  next  ask  about  the  properties  we  wish  the  change  of  variables  H  to 
have.  If  we  only  demand  that  be  a  homeomorphism,  then  Denjoy’s  The¬ 
orem  ([6]  p.  301)  says  that  if  the  rotation  number  of  (p  is  irrational,  we  can 
always  find  an  H  which  conjugates  (P  to  a  rotation.  However,  if  we  want  more 
detailed  information  about  the  dynamics  it  makes  sense  to  ask  that  H  have  ad¬ 
ditional  smoothness.  In  fact,  it  is  natural  to  ask  that  H  he  as  smooth  as  the 
diffeomorphism  itself  -  in  this  case,  analytic.  (There  will,  in  general,  be  some 
loss  of  smoothness  even  in  this  case.  We  will  find,  for  example,  that  while  there 
exists  an  analytic  conjugacy  function,  H,  its  domain  of  analyticity  will  be  some¬ 
what  smaller  than  that  of  (p.)  Surprisingly,  the  techniques  which  Denjoy  used 
fail  completely  in  this  case,  and  the  answer  was  not  known  until  the  late  fifties 
when  Arnold  applied  KAM  techniques  to  answer  the  question  in  the  case  when 
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7J  is  small.  Even  more  surprisingly,  in  order  to  even  state  Arnold’s  theorem,  we 
have  to  discuss  a  little  number  theory. 

Any  irrational  number  can  be  approximated  arbitrarily  well  by  rational  num¬ 
bers,  and  in  fact,  Dirichlet’s  Theorem  even  gives  us  an  estimate  of  how  good 
this  approximation  is.  More  precisely,  it  says  that  given  any  irrational  number  p, 
there  exist  infinitely  many  pairs  of  integers  {m,n)  such  that  \p-  (m/n)l  <  1/n  . 
On  the  other  hand,  most  irrational  numbers  can’t  be  approximated  much  better 

than  this. 

Definition  2.2.  The  real  number  p  is  of  type  {K^u)  if  there  exist  positive 
numbers  K  and  v  such  that  \p  -  {m/n)\  >  A'ln|"‘',  for  all  pairs  of  integers 
(m,  n) . 

Proposition  2.1.  For  every  u  >  2j  almost  every  irrational  number  p  is  of 
type  {K,  u)  for  some  K  >  0. 

Proof:  The  proof  is  not  difficult,  but  would  take  us  a  bit  out  of  our  way.  The 
details  can  be  found  in  [3],  page  116,  for  example.  Note  also,  that  we  can  ^sume 
without  loss  of  generality  that  K  <  I,  since  if  p  is  of  type  (A,  v)  for  some  K  >  I, 
it  is  also  of  type  (1,  v). 

Theorem  2.1  (Arnold’s  Theorem  [1]).  Suppose  that  p  is  of  type  {K,u). 
There  exists  e{K,  u,a)>0  such  that  if  4>{x)  =x  +  p  +  v{x)  has  rotation  number 
p,  and  l|?}l|ff  <  e{K,v,a),  then  there  exists  an  analytic  and  invertible  change  of 
variables  H{x)  which  conjugates  (j)  to  Rp. 

As  mentioned  above,  Arnold’s  proof  of  this  theorem  used  the  KAM  theory. 
The  proof  can  be  broken  into  two  main  parts  -  an  analysis  of  a  linearized  equa¬ 
tion,  and  a  Newton’s  method  iteration  step.  These  same  two  steps  will  reappear 
in  tL  next  section  when  we  discuss  nearly  integrable  Hamiltonian  systems,  and 
they  are  characteristic  of  almost  all  appUcations  of  the  KAM  theory. 

Remark  2.4.  It  may  seem  that  by  assuming  that  the  diffeomorphism  is  of  the 
form  (fix)  =  x+p+ri{x),  where  p  is  the  rotation  number  of  <f,  we  are  considering 
a  less  general  situation  than  that  described  above  in  which  we  allowed  (f  to  have 
the  form  x  +  a  +  pix).  As  we  shall  see  below,  there  is  no  real  loss  of  generality 

in  this  restriction. 


Step  1:  Analysis  of  the  Linearized  equation 

Note  that  since  ||7j|U  is  small,  the  diffeomorphism  (j>  is  “close”  to  the  pme 
rotation  R^.  Thus,  we  might  hope  that  if  a  change  of  variables  H  which  satisfies 
(2)  exists  is  would  be  close  to  the  identity  i.e.  H{x)  =  a:  -I-  h{x),  where  is 
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“small”.  If  we  make  this  assumption  and  substitute  this  form  of  H  in  (2),  we 
find  that  h  should  satisfy  the  equation 

(^)  h{x  +  /?)  -  h{x)  =  r){x  +  h{x)) 

If  we  now  expand  both  sides  of  this  equation,  retaining  only  terms  of  first  order 
in  the  (presumably)  small  quantities  h  and  r;,  we  find: 

W  h[x  +  p)  -  h{x)  =  'q{x) 

Since  all  the  functions  in  this  equation  are  periodic,  and  the  equation  is  linear  in 
the  unknown  function  /i,  we  can  immediately  write  down  a  (formal)  solution  for 
the  coefficients  in  the  Fourier  series  of  h.  If  f}{n)  is  the  Fourier  coefficient  of 
T],  then  the  Fourier  coefficient  of  h  is 


(5) 


h{n)  = 


Vin) 

g27rinp  _  ^  ’ 


n^O  . 


In  just  a  moment,  we  will  address  whether  or  not  the  function 

n#0  njtO  ’■ 

makes  any  sense,  however,  we  first  note  that  even  if  (5)  defines  a  well-behaved 
function,  it  will  not  solve  (4)  but  rather: 


(®)  +  p)  -  H^)  =  »?(x)  -  [  rj{x)dx  =  7)[x)  -  ?y(0)  . 

Jo 

This  is  because  the  zeroth  Fourier  coefficient  of  h  drops  out  of  (4).  The  fact  that 
h  does  not  solve  (4)  will  complicate  the  estimates  below.  The  problems  with 
showing  that  (5)  converges  arise  due  to  the  presence  of  the  factors  of  - 

1  in  the  denominator  of  the  summands,  and  these  are  the  (in)famous  small 
denominators  which  plagued  celestial  mechanics  in  the  last  century  and  which 
the  KAM  theory  finally  overcame.  We  first  note  that  if  p  is  rational,  there  is  little 
hope  that  the  sum  defining  h  will  converge  since  the  denominators  in  this  sum 
will  vanish  for  the  infinitely  many  n  for  which  pn  =  m  for  some  meZ  Thus, 
we  can  only  hope  for  success  if  p  ^  Q.  If  p  is  irrational,  the  denominator  will 
still  be  large  whenever  np  «  m.  However,  by  assuming  that  p  is  of  type  (K,  i'), 
we  have  some  control  over  how  close  to  zero  the  denominator  can  become.  In 
fact,  the  following  lemma  immediately  allows  us  to  estimate  h{x). 

Lemma  2.2.  If  p  is  of  type  {K,v),  then 

|g2,rmp  _  ^  >  4;^|„|-(^-l)  if  ^  ^  Q  . 
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Proof:  Since  p  is  of  type  we  know  that  \pn-m\>  and  the 

lemma  follows  by  writing  -  1|  =  2|  sin(7r(pn  -  m))|,  and  then  using 

the  fact  that  |sin(7rx)|  >  2\x\,  for  |a:|  <  1/2. 


□ 


The  other  fact  which  we  must  use  to  estimate  h  is  the  fact  that  since  t]  & 
Cauchy’s  theorem  immediately  gives  an  estimate  on  its  Fourier  coefficients  of 
the  form  |f)(n)|  <  ■  Combining  this  remark  with  Lemma  2.2,  we  see 

that  if  \Imz\  <a  -S,  one  has 


\h{z)\  = 


2^  p2'Kinp  _  \  '  —  2L^  4X 
n^O  n^O 


g-27ro-ln|g27r|n|(cr-<5) 


< 


r{v) 


K{2nSy 


where  r(i/)  =  x’'~^e~^dx,  and  we  have  assumed  that  2'k{K  + 1)^  <  iirS  <  1. 

Thus  we  have  proven, 


Proposition  2.2.  If  p  is  of  type  {K,u),  and  r)  €  Ba,  then  h{x),  defined  by 
(5 )  is  an  element  of  B^-s  for  any  6  >  0,  and  if  47r5  <  1,  we  have  the  estimate: 


r(p) 


K{2-k6Y 


Remark  2.5.  Note  that  we  do  not  get  an  estimate  on  h  in  B^  we  lose  some 
analyticity.  This  is  why  we  can’t  use  an  ordinary  Implicit  Function  Theorem  to 
solve  (2).  Indeed,  if  we  were  to  attempt  to  proceed  with  an  ordinary  iterative 
method  to  solve  (2),  we  would  find  that  we  gradually  lost  all  of  the  analyticity 
of  our  approximate  solution.  This  is  where  the  second  “big  idea”  of  the  KAM 
theory  enters  the  picture,  namely: 

Step2:  Newton’s  Method  in  Banach  Space 

Recall  that  Newton’s  method  says  that  if  you  want  to  find  the  roots  of  some 
nonlinear  equation,  you  should  take  an  approximate  solution  and  then  use  a 
linear  approximation  to  the  function  whose  roots  you  seek  to  improve  your  ap¬ 
proximation  to  the  solution.  You  then  use  this  improved  approximation  as  your 
new  starting  point  and  iterate  this  procedure.  In  the  present  circumstance,  we 
regard  (/>  as  an  approximation  to  Rp  and  then  use  the  linear  approximation, 
H{x)  =  x  +  h{x),  to  the  conjugating  function  to  improve  that  approximation. 
Recall  that  if  h{x)  had  solved  (2)  exactly,  then  o<j)oH  =  Rp.  Our  hope  is 
that  if  we  use  the  Hix)  that  comes  from  solving  (6),  o(j)oH  will  be  closer 
to  Rp  than  4>  was  and  then  we  can  iterate  this  process. 
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The  first  thing  we  must  do  is  check  that  H  is  invertible.  Since  H{z)  =  z+h{z), 
H  will  be  invertible  with  analytic  inverse  on  any  domain  on  which  ||/i'||  <  1. 
Cauchy’s  theorem  and  Proposition  2.2  imply  that  \\h%_2e  <  so 

we  conclude  ^ 


Lemma  2.3.  //27rrM||i,||,  <  and  dirtf  <  1,  then  H{z)  has  an 

analytic  inverse  on  the  image  of  8^-26- 

Remark  2.6.  Note  that  if  we  combine  the  inequalities  in  Lemma  2,3  and 
Proposition  2.2,  we  find  <  6.  Thus,  if  z  e  S^^26,  H{z)  e  Fur¬ 

thermore,  H  maps  the  real  axis  into  itself,  and  the  images  of  the  lines  Imz  = 
±(cr  —  26)  lie  outside  the  strip  8^-36-  With  this  information  it  is  easy  to  show 
that  Range{H\s^_^^g)  D  8^-367  so  that  H~^[z)  is  defined  for  all  z  G  8^-36- 

In  addition  to  knowing  that  the  inverse  exists,  we  need  an  estimate  on  its 
properties  which  the  following  proposition  provides. 

Proposition  2.3.  If 

27rrM|ir/||,  <  ir(27r^)(-+i)  and  47r(5  <  1  , 
then  H^^(z)  —  z  ~  h{z)  +  g{z),  where 


Holier— 45  ^ 


27rr(z/)2 

ii:2(27r^)(2t/+i) 


Proof:  If  we  define  g(z)  by  g{z)  —  H  ^(z)  —  z  h{z),  then  we  see  that 

z  =  H-^o  H{z)  =  z  +  h{z)  ~  h{z  +  h{z))  +  g{z  +  h{z))  . 

Thus,  g{z-hh{z))  =  h{z-^h{z))-hiz)  =  h'{z+sh{z))h{z)ds.  Setting^  =  H{z), 
this  becomes 

5(0=  [  h'{H-\0  +  sh{H-\^)))h{H-\0)ds  . 

J  0 

In  a  fashion  similar  to  that  in  the  remark  just  above,  Range{H\s^_^^)  3 

so  if  ^  6  Sa-is,  H-^iO  e  Scr-zs,  and  hence  +  sh{H-^{i))  €  so 

Applying  the  estimates  on  h  and  h'  from  above  we  obtain 


□ 

Let  us  now  examine  the  transformed  diffeomorphism  ^{x)  =  o(f>o  H{x). 
Since  h  is  only  an  approximate  solution  of  (2),  0  will  not  be  exactly  a  rotation, 
but  since  h  did  solve  the  linearized  equation  (6),  we  hope  that  }  will  differ  from 
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a  rotation  only  by  terms  that  are  of  second  order  in  the  small  quantities  h  and 
r/.  Using  the  form  oi  given  by  Proposition  2,3,  we  find 

^(x)  =  o  (f>o  H{x)  =  H~^{x h{x) p-{~r}{x h{x))) 

=  a:  +  h{x)  +  p  +  7?(a;  +  h{x))  -h{x-\-p-\-  h{x)  +  r){x  +  h{x)))  + 

+  g{x  +  h{x)  +  p  +  7?(x  +  h{x))) 

=  a;  +  p  +  {h{x)  -  h{x  +  p)  +  r]{x)}  +  {r/(x  +  h{x))  -  r]{x)}  + 

+  {h{x  +  p)  -  H-  p  +  h{x)  +  r}{x  +  /i(x)))}  + 

+  g{x  +  h{x)  +  p  +  77(0;  +  h{x)))  . 

We  first  note  that  because  h  solves  (6),  the  first  expression  in  braces  in  the 
second  to  last  line  of  this  sequence  of  equalities  is  equal  to  f/o-  The  next  expression 
in  braces  equals  +  sh{x))h{x)ds.  If  27rr(z/)||77llcr  <  K{27i:6)  ,  47r^  < 

1,  and  X  G  S(j—4s^  we  can  bound  the  norm  of  this  expression  on  B(j—4s  by 
Similarly,  the  quantity  in  braces  in  the  next  to  last  line 

may  be  rewritten  as  fg  h'{x  +  p  +  s{h{x)  +  r}{x  +  h{x)))(h[x)  +  r{{x  +  h{x))ds. 
Once  again,  assuming  that  the  conditions  on  WgWa  S  described  above  hold, 
and  that  x  e  5^-45,  then  we  can  bound  the  norm  of  this  expression  on  Bcr-46 

^  NU}II’?IU  <  Ill'll-  ’ 

where  the  last  inequality  used  the  fact  that  2'ir6K  <  1.  Finally,  if  \Imx\  <  a  — 6<5, 
then  \Im{x  +  h{x)  +  p  +  7?(a:  +  h(x)))|  <  a  -  46,  (since  \\T]\\a  <  <5),  so  that  the 
last  term  in  this  expression  is  bounded  by  Proposition  2.3. 

Define  t]{x)  by  ^{x^  —  x  p  ^(^)*  By  Remark  2.3,  0  has  rotation  number 
p,  so  by  Lemma  2.1  there  exists  a;o  such  that  7]{xo)  =  0.  Combining  this  remark 
with  the  expression  for  0  just  above,  we  find 

f?(0)  =  -{g{xo  h{xo))  -  r}{xo)} 

—  {h[xQ  +  p)  —  h{xQ  +  p  +  h{xo)  +  T}{xo  +  h(xo)))} 

-  g{xo  +  h{xo)  +  p  +  77(xo  +  h{xo)))  . 

In  the  previous  paragraph  we  bounded  the  norm  of  each  of  the  expressions  on 
the  right  hand  side  of  this  equality,  so  we  conclude  that 

o  Tiu)  47r(T{i/))^  ,1  „2  , 

|77(0)|  <  27r||»?|U-^^2^^y+i  K‘^(2Tr6)^‘'+^  K'^{2n6)'^''+^  ' 

Combining  this  estimate  with  the  bounds  above,  we  have  proven, 

Proposition  2.4.  7/27rr(p)||7?||cr  <  K(2nS)^'''^^\  and  4nS  <  1,  then  ^{x)  = 
o4>oH{x)  =  x  +  p  +  vix),  where 

II -II  ^  167r(r(p))2  ,,  ,|2 

Wilh-es  <  ' 
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Remark  2.7.  The  important  thing  to  note  about  the  estimate  off]  is  that  in 
spite  of  the  mess,  it  is  second  order  in  the  small  quantity  \\T]\\cr  as  we  had  hoped. 
That  is,  there  exists  a  constant  C{K,6,u)  such  that  ||^|U-65  <  C{K,6,u)\\r)\\l. 
This  is  what  makes  the  Newton’s  method  argument  work. 

The  proof  of  Arnold’s  Theorem  is  completed  by  inductively  repeating  the 
above  procedure.  The  principal  point  which  we  must  check  is  that  we  don’t 
lose  all  of  our  domain  of  analyticity  as  we  go  through  the  argument  ^  note 
that  (j)  is  analytic  on  a  narrower  strip  than  was  our  original  diffeomorphism  0. 
The  essential  reason  that  there  is  a  nonvanishing  domain  of  analyticity  at  the 
completion  of  the  argument  is  that  the  amount  by  which  the  analyticity  strip 
shrinks  at  the  n  step  in  the  induction  will  be  proportional  to  the  amount  by 
which  our  diffeomorphism  differs  from  a  rotation  at  the  iterative  step,  and 
thanks  to  the  extremely  fast  convergence  of  Newton’s  method,  this  is  very  small. 

The  Inductive  Argument 

Let  (I)q{x)  =  (j){x),  be  our  original  diffeomorphism,  and  set  r]o{x)  =  r]{x). 
Define  (j)i{x)  =  o(j)QoHQ{x),  and  by  induction,  (l)n+i{x)  =  = 

x  +  r]n+\{x)  where  Hn{x)  =  x^-  hn{x),  and  solves 


hn{x  +  p)  ~  hn(x)  =  rin{x)  —  , 

Also  define  the  sequence  of  inductive  constants: 

•  ~  36(1 +n2)  5  ^  0  (Note  that  this  insures  that  AttSo  <  1.) 

•  CTO  =  a,  and  ^  6<5„,  if  n  >  0. 

•  eo  =  and  Cn  =  if  n  >  0. 

Note  that  cr*  =  lim„_,oo  (Tn  >  cr/2.  We  now  have: 

Lemma  2.4  (Inductive  Lemma).  If 


then  0n+i(a;)  -  x  +  p  +  rjn+i{x),  with  e  and  \\Vn+i\\<Tn+i  <  ^n+i. 

FuHhermore,  Hn{x)  =  x  +  hn{x)  satisfies 

llh  II  A  < 

II  n||a„  6n  <  X{27r6ny  ’ 

while  H-^{x)  =  a;  -  hn{x)  +  gr,{x),  where 


llo  II 

\\9n\\a  <  ^2(27r<5^)2i/+l  * 

Proof:  Note  that  Proposition  2.2  and  Proposition  2.3  imply  that  the  estimates 
on  hn  and  hold  for  n  =  0.  The  estimate  on  l|r?i||cri  follows  by  noting  that 
from  Proposition  2.4, 


ll^lllcTi  ^ 


167r(r(i/))^ 

A2(27r(5)2^+i 
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and  the  hypothesis  on  the  inductive  constants  in  the  Inductive  Lemma  was  chosen 
so  that  this  last  expression  is  less  than  =  ei.  This  completes  the  first 

induction  step. 

Now  suppose  that  the  induction  holds  for  n  =  0, 1, . . . ,  iV  - 1,  so  that  we  know 
that  ||77iv|Uiv  <  To  prove  that  it  holds  for  n  =  N,  first  choose  Hn  to  solve 
hNix  p)  ~  hN{x)  =  7)n{x)  -  f)iv(O).  By  Proposition  2.2,  and  the  inductive 
estimate  on  rjN^  we  will  have  l|hivl|crjv-^jv  —  while  Proposition  2.3 

implies  that  =  x  -  hiv(rr)  -\r  gN{x),  with  ^ 

Finally,  if  we  define  o  o  Hn  —  x-]-  p  +  t/at+i,  with  r/jv+i  defined 

in  analogy  with  fj  in  Proposition  2.4,  then  we  see  that 

II  II  ^  l67rT{uy  , 

\\m+i\\aN+i  <  K‘^[27r6Ny^-^^  ^  ‘ 

Once  again,  if  use  the  hypotheses  on  the  inductive  constants  we  see  that  this 
expression  is  bounded  by  which  completes  the  proof  of  the  lemma. 

□ 

With  the  aid  of  the  Inductive  Lemma  it  is  easy  to  complete  the  proof  of 
Arnold’s  Theorem.  Define 

7iN{^)  =  HooHio---o  Hn{x) 

=  x  +  hN{x)  +  hjsf-iix  +  hn{x)  +  /iiv_2(a;  +  hj^{x)  +  hN-i{x  +  hN{x))) 
+ - hho(x  +  hi(a; + 

By  the  Induction  Lemma,  Hat  is  analytic  on  ^Ar(^)  —  2^  is  bounded 

by  ^  converges  as  a  consequence  of  the  hypotheses 

on  the  induction  constants  in  the  Induction  Lemma.)  Furthermore, 

Hn+i{z) -'Hn{z)  =Hn  ^  H]s[{z) —  1-Cn{z)  =  [  Ti'jsfiz -\-th]s[{z))hN{z)ds  , 

Jo 

so  WHn+i  -  WivlU„+:  ^  (If  +  definition  of  the 

inductive  constants,  the  right  hand  side  of  this  inequality  converges  if  summed 
over  N.  Thus,  Hn  converges  uniformly  to  some  limit  H  on  Sa*,  and  H  is 
analytic.  Furthermore,  n{z)  =  z-^h{z),  where  the  estimates  on  Hn{z)-z,  plus 
Cauchy’s  Theorem  imply  that  if  6*  =  cr*/16,  then  ||/i'||cr*-<5*  <  A/6*  <  6*,  again 
using  the  definition  of  the  inductive  constants.  By  an  argument  similar  to  that 
following  Lemma  2.3,  we  see  that  H  is  invertible  on  the  image  of  Sa*-6*  and 
that  this  image  contains  Sa*-26*- 

Finally,  note  that  <f)oHN{^)  =  Hn^<I>n{^)  =  HN{z-\-p-\-r}N{z))’  As  A'  — >  00, 
we  see  that 

SoHiz)—  lim  Hn^<I>n{z)=  bm  Hn{^  +  P-\-Vn{^))  =  Ho  Rp{z)  , 

N-^oo  N^oo 
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for  all  2;  G  Sa*~26*-  (The  last  equality  used  the  inductive  estimate  on  Since 
H  is  invertible  on  this  domain,  this  implies  o(j)oH  =  Rp,  so  7i  is  the  dif- 
feomorphism  whose  existence  was  asserted  in  Arnold’s  Theorem. 


□ 

Remark  2.8.  Suppose  that  in  Arnold^s  Theorem  we  were  given  a  diffeomor- 
phism  of  the  (apparently)  more  general  form 

'0(a:)  =  X  +  a  +  pi{x)  . 

but  still  with  rotation  number  p  of  type  (where  a  ^  p.)  We  can  rewrite 

i){x)  =  x-\-p->r{a-p+n{x))  =  x+/9+77(x).  IfiriW^  =  \\{a-p)+p\\„  <  e{K,u,a), 
then  Theorem  2.1  implies  that  'ip  is  analytically  conjugate  to  Rp 

Remark  2.9.  In  examples  it  may  be  difficult  to  determine  from  inspection  of 
the  initial  diffeomorphism  what  the  rotation  number  is.  In  such  cases  there  is 
often  a  parameter  in  the  problem  which  can  be  varied  and  which  allows  one  to 
show  that  the  conjugacy  in  Amold^s  Theorem  exists  at  least  for  most  parameter 
values.  For  instance^  the  following  result  can  be  proven  by  easy  modifications  of 
the  previous  methods:  (See,  [1],  page  271.) 

Theorem  2.2.  Consider  the  family  of  diffeomophisms: 
i'^)  (paA^)  ~xAa-\-  eri{x)  , 

for  a  e  [0, 1].  For  every  S  >  0,  there  exists  eo  >  0  such  that  if  |e|  <  eo,  there 
exists  a  set  A{e)  C  [0, 1]  such  that  for  a  E  A{€),  is  analytically  conjugate  to 
a  rotation  of  the  circle,  and  \Lebesgue  measure  (A(e))  -  1|  <  ^. 

Remark  2.10.  It  is  not  necessary  to  work  with  analytic  functions.  For  in¬ 
stance,  Moser  [10]  showed  that  if  the  original  diffeomorphism  is  C^,  and  if  the 
rotation  number  is  of  type  {K,v),  then  if  k  is  sufficiently  large  (depending  on 
v),  and  the  diffeomorphism  is  a  sufficiently  small  perturbation  of  a  rotation,  the 
diffeomorphism  is  conjugate  to  a  rotation,  with  a  C^'  conjugacy  function,  for 
some  1  <  k^  <  k.  Note  that  this  theorem  is  still  “local”  in  that  it  demands 
that  the  diffeomorphism  which  we  start  with  be  “close”  to  a  pure  rotation.  More 
recent  work  of  Hermann  [7]  and  Yoccoz  [12],  has  lead  to  a  remarkably  complete 
understanding  of  the  global  picture  of  the  dynamics  of  circle  diffeomorphisms. 
For  instance  (see  [12]),  one  can  write  the  real  numbers  as  a  union  of  two  disjoint 
sets  A  and  B,  and  prove  that  any  analytic  circle  diffeomorphism,  (p,  with  rota¬ 
tion  number  p[(j))  E  B  is  analytically  conjugate  to  the  rotation  Rp{<p),  while  for 
any  a  E  A,  there  exists  an  analytic  circle  diffeomorphism  with  rotation  number 
a  which  is  not  analytically  conjugate  to  R^. 
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3.  Nearly  Integrable  Hamiltonian  Systems 

In  this  section  we  address  the  KAM  theory  in  its  original  setting,  namely 
nearly  integrable  Hamiltonian  systems.  Recall  that  a  Hamiltonian  system  (in 
Euclidean  space)  is  a  system  of  2N  differential  equations  whose  form  is  given  by 


^  dqj 


;  j  =  1,...,N  , 


for  some  function  i? (p, 5) .  (Here p  —  (pi, . . .  ,PAr)  3'iid  ?  (91)---)9jv)  ) 

In  general  these  equations  axe  just  as  hard  to  solve  as  any  other  system  of  2N 
coupled,  nonlinear,  ordinary  differential  equations,  but  in  special  circumstances 
(the  integrable  Hamiltonian  systems)  there  exists  a  special  set  of  variables 
known  as  the  action-angle  variables,  (/,  (A),  I  £  and  4>  ^  >  ®uch  that  in 

terms  of  these  variables,  H{1, 4>)  =  h{I).  Since  the  Hamiltonian  does  not  depend 
on  the  angle  variables  0,  the  equations  of  motion  are  very  simple  -  they  become 


ij 

(j>j 


We  can  solve  these  equations  immediately,  and  we  find  that  I{i)  =  /(O),  and 
4,{t)  =  u!{I)t  +  <A(0).  Thus,  for  an  integrable  system  with  bounded  trajectories, 
the  action  variables  I  are  constants  of  the  motion,  while  the  angle  variables  <j) 
just  process  around  an  W-dimensional  torus  with  angular  frequencies  io  given  by 
the  gradient  of  the  Hamiltonian  with  respect  to  the  actions.  (In  particular,  ff 
the  components  of  w(/)  are  irrationally  related  to  one  another,  4>{t)  is  a  quasi- 
periodic  function.) 


Remark  3.1.  The  three-body  (or  N-body)  problem,  in  which  we  ignore  the 
mutual  interaction  between  the  planets  is  an  integrable  system. 


Now  suppose  that  we  start  with  an  integrable  Hamiltonian  h{I)  and  make  a 
small  perturbation  which  depends  on  both  the  action  and  the  angle  variables  — 
as  in  the  case  of  the  solar  system  when  we  consider  the  gravitational  interactions 
between  the  planets.  Then  the  Hamiltonian  takes  the  form. 


(8)  H [I,  <f>)  =  h{I)  +  f  {!,<(>)  ■ 

As  before  we  will  assume  that  the  Hamiltonian  function  is  analytic  in  order 
to  avoid  complications.  More  precisely,  if  we  think  of  /(/,  <A)  3®  ®  function  on 
R^xR'^,  which  is  periodic  in  (j>,  then  v/e  assume  that  there  exists  some  I*  eR 
such  that  H  can  be  extended  to  an  analytic  function  on  the  set  Acr,p{I*)  = 
{(/, ,A)  e  X  I  |/  -  /*|  <  P  ,  \Im{4>j)\  <  <7  ,  i  =  1, . . . ,  AT  }.  (I  will  always 

use  the  norm  for  W-vectors  -i.e.  \I\  =  Ef=i  We  define  the  norm  of  a 
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function  /,  analytic  on  A.,p{r)  by  ||/|U.p  =  \f{I,4>)\.  (As  in 

the  previous  section,  one  can  assume  without  loss  of  generality  that  a  <  1.) 

In  addition,  since  we  are  interested  in  nearly  integrable  Hamiltonian  systems, 
we  will  assume  that  /  has  small  norm,  (Just  as  we  assumed  that  r}  was  small 
in  the  previous  section.)  Furthermore,  we  can  assume  that  /(/,  (f))d^<f)  =  0, 
since  if  this  were  nonzero  it  could  be  absorbed  by  redefining  h{I). 

Question:  Do  the  trajectories  of  this  perturbed  Hamiltonian  system  still  lie  on 
invariant  tori,  at  least  for  f  sufhciently  small? 

To  state  the  answer  of  this  question  more  precisely,  we  need  an  analogue  of 
the  numbers  of  type  (K,  p)  introduced  in  the  previous  section. 

Definition  3.1.  We  say  that  a  vector  u  e  is  of  type  (^,7)  if 

N 

|(a;,n)|  =  I  ,  for  all  n  e  Z^\0  . 

.7=1 

Remark  3.2.  Note  that  if  p  is  of  type  {K,p),  then  the  vector  (p, -1)  is  of 
type  (T,  7)  with  K  =  L  and  7  =  p  —  1.  Also,  we  again  assume  without  loss  of 
generality  that  L  <1. 

Given  this  remark,  and  the  fact  that  we  know  that  the  numbers  of  type  {K,  p) 
are  a  subset  of  the  real  line  of  full  Lebesgue  measure,  the  following  result  (whose 
proof  we  omit)  is  not  surprising. 

Proposition  3.1.  Ifj  >  N',  almost  every  uj  e  is  of  type  (^,7)  for  some 
L<1, 

We  are  now  in  a  position  to  state  the  KAM  theorem. 

Theorem  3.1  (KAM).  Suppose  that  uj{r)  =  uj*  is  of  type  (L,7),  and  that 
the  Hessian  matrix  is  invertible  at  /*.  (And  hence  on  some  neighborhood  of 
I  .)  Then  there  exists  cq  >  0  such  that  if  ||/||cr^p  <  eo,  the  Hamiltonian  system 
(^)  has  a  quasi-periodic  solution  with  frequencies  a;(/*). 

Remark  3.3.  Although  we  have  claimed  in  the  theorem  only  that  at  least  one 
quasi-periodic  solution  exists  in  the  peHurbed  hamiltonian  system,  we  will  see  in 
the  course  of  the  proof  that  the  whole  torus,  1  =  1*,  survives. 

Remark  3.4.  One  might  wonder  why  we  study  quasi-periodic  orbits  rather 
than  the  apparently  simpler  periodic  orbits.  If  one  considers  values  of  the  ac¬ 
tion  variables  for  which  the  frequencies  ljj  (/)  are  all  rationally  related,  then  the 
integrable  hamiltonian  will  have  an  invariant  torus,  filled  with  pemodic  orbits. 
However,  under  a  typical  perturbation,  all  but  finitely  many  of  these  periodic 
orbits  will  disappear.  Hence,  the  quasi-periodic  orbits  are,  in  this  sense,  more 
stable  than  the  periodic  ones. 
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As  we  will  see,  the  proof  follows  very  closely  the  outline  of  the  previous  section. 
In  particular,  we  begin  with: 

Step  1:  Analysis  of  the  Linearized  equation 

The  basic  idea  is  to  find  new  variables  (/,  0)  such  that  in  terms  of  these  new 
variables  (8)  will  be  integrable.  However,  not  just  any  change  of  variables  is 
allowed,  because  most  changes  of  variables  will  not  preserve  the  Hamiltonian 
form  of  the  equations  of  motion.  We  will  admit  only  those  changes  of  variables 
which  do  preserve  the  Hamiltonian  form  of  the  equations.  Such  transformations 
are  known  as  canonical  changes  of  variables.  There  is  a  large  literature  on 
canonical  transformations,  (for  a  nice  introduction  see  [2]),  but  pursuing  it  would 
take  us  too  far  afield.  In  order  to  come  to  the  point  in  as  expeditious  a  fashion 
as  possible,  let  us  just  note  the  following: 

Proposition  3.2.  Suppose  that  there  exists  a  smooth  function  !](/,  (^)  such 
that  the  equations: 

,  ds  -  as 

~  d<l)  ’  di  ’ 

can  he  inveHed  to  find  {I,<t>)  =  ^{1,4’)-  Then  ^  is  a  canonical  transformation, 
and  S  is  called  its  generating  function. 

Proof:  See  [2],  section  48. 

Remark  3.5.  Note  that  T.(i,4>)  =  {i,(p)  is  the  generating  function  for  the 
identity  transformation.  (Here,  (•,  ■>  is  the  inner  product  in  .) 

Remark  3.6.  There  are  other  ways  of  generating  canonical  transformations. 
In  particular,  the  Lie  transform  method  has  proven  to  be  very  convenient  for 
computational  purposes  [5] .  However,  the  generating  function  method  offers  a 
simple  and  direct  way  to  prove  the  RAM  theorem  and  for  that  reason  I  have 
chosen  it  here. 

We  would  like  to  find  a  canonical  transformation  {I,4>)  =  ^{1,4)  such  that 
F(i,^)  =  Ro$(7,^)  =  M/),or 

(9)  H{^{i,4>),<i>)  =  m  ■ 

(This,  by  the  way,  is  the  Hamilton- Jacobi  equation.  In  the  last  century,  Jacobi 
proved  the  integrability  of  a  number  of  physical  systems  by  finding  solutions  of 
this  equation.)  In  our  example,  (9)  can  be  written  as: 

(10)  +/((§(t0),'^)  =  m  ■ 

Since  H  is  “close”  to  an  integrable  Hamiltonian  for  /  small,  we  can  hope  that 
the  canonical  transformation  is  “close”  to  the  identity  transformation.  Using 
the  fact  that  we  know  the  generating  function  for  the  identity  transformation, 
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we  will  look  for  canonical  transformations  whose  generating  functions  are  of  the 
form  S(/,  0)  =  (/,  ^)  +  5(7,  </>),  where  S  is  0{\\f\\a,p),  the  amount  by  which  our 
Hamiltonian  differs  from  an  integrable  one.  If  we  substitute  this  form  for  E  into 

(10)  and  expand,  retaining  only  terms  that  are  formally  of  first  order  in  the  small 
quantities  ||/||(t,p  and  ||*S'||o-,p,  we  obtain  the  linearized  Hamilton- Jacobi  equation: 

(11)  (uii),  —  (7, 0))  +  /(7, 0)  =  h{i)  -  h{i) . 

Once  again,  we  now  have  a  linear  equation  involving  periodic  functions,  so  if  we 
expand  /(7,  (/>)  =  /(7,n)e^^""^^’^>,  we  can  solve  (11)  and  we  find 


Remark  3.7.  Once  again,  as  in  (^),  the  function  S  defined  (formally)  by 

(12)  does  not  satisfy  (11),  but  rather 

(13)  = 

and  we  will  be  forced  to  estimate  the  difference  between  these  two  equations  below. 


Note  that  once  again,  we  will  face  small  denominators.  Indeed,  for  a  dense 
set  of  points  7,  the  denominators  in  (12)  will  vanish  for  infinitely  many  choices 
of  n.  This  is  the  reason  that  many  people  (including  Poincare)  at  the  end  of 
the  last  century  believed  that  these  series  diverged.  Nonetheless,  the  results 
of  Kolmogorov,  Arnold  and  Moser  show  that  “most”  (in  the  sense  of  Lebesgue 
measure)  points  7  give  rise  to  a  convergent  series.  Having  S  be  defined  only  on 
the  complement  of  a  dense  set  of  points  7  would  be  a  problem,  since  we  would  be 
hard  pressed  to  take  the  derivatives  we  need  in  order  to  compute  the  canonical 
transformation  in  Proposition  3.2.  To  proceed,  we  take  advantage  of  the  fact 
that  because  of  the  analyticity  of  /,  the  Fourier  coefficients  / (7,  n)  are  decaying 
to  zero  exponentially  fast  as  \n\  becomes  large.  Thus,  if  we  truncate  the  sum 
defining  S  to  consider  only  \n\  <  M,  for  some  large  M  we  will  make  only  a 
relatively  small  error  in  the  solution  of  (11).  On  the  other  hand,  since  there  are 
now  only  finitely  many  terms  in  the  sum  defining  S,  we  can  find  open  sets  of 
act  ion- variables  on  which  the  generating  function  is  defined.  Before  stating  the 
precise  estimate  on  S,  we  introduce  a  few  preliminaries. 

First,  define  O  >  1,  such  that 


max 


( (  sup 

V  \i-’'\<p 


d^h 

W 


ll)>(  sup 


<  Q  . 


(Here,  ||  •  ||  is  the  norm  of  the  matrix  considered  as  an  operator  from 
with  the  norm.)  Analogously,  define  fl  such  that  sup|/_;^.  \<p  ll(tF)ll  < 
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(In  this  case,  ||  •  ||  is  the  norm  of  (|^)  considered  as  a  bilinear  operator  from 
qN  ^  (jN  ^  ,)  Next  note  that  if  we  define 


will  no  longer  be  a  solution  of  (13),  but  rather  will  solve 


where  /^(/,  0)  =  X]jn|<M  Note  that  we  have  already  discarded 

all  terms  that  were  formally  of  more  than  first  order  in  ||/||cr,p  in  order  to  derive 
(11).  Thus,  if  in  deriving  this  equation  for  we  change  (11)  only  by  amounts 
of  this  order,  we  won’t  have  qualitatively  worsened  our  approximation.  We  will 
choose  M  in  order  to  insure  that  this  is  the  case. 


Proposition  3.3.  Choose  0  <  6  <  a,  and  set  M  =  1  log(||/||o-,p)|/(7r<5)-  U 
p  <  L/(20M^'^^)  and  inS  <  1,  then  is  analytic  on  Aa-6,p{I*),  o,nd 


/8r(7  +  l)\^(2iV^)  ll/Ik, 

V  (2'K6)'y+'^  )  2tvL 


Proof:  Recall  that  we  chose  our  domain  A(j,p{I*)  so  that  it  was  centered  (in 
the  I  variables)  at  a  point  with  oj{I*)  ==  a;*.  Now  suppose  that  we  choose 
|n|  <  M,  and  consider  (u;(7),n)  for  some  other  point  I  in  our  domain.  Writing 
/  =  /*  +  (/-  /*),  we  see  that  \{oj{I),n)  -  (u;(7*),n)|  <  n\n\p.  If  we  then  use 
the  fact  that  u*  is  of  type  (7/,  7),  we  find  that  for  |n|  <  M  and  all  |7  -  7*|  <  p, 

|(a;(/),n}|  =  |K>n)  +  ((a;(/),n)  -  (a;*,n))|  >  ^-n\n\p>  ^  , 

where  the  last  inequality  used  the  hypothesis  on  p  and  the  fact  that  |n|  <  M. 
If  we  combine  this  observation  with  the  fact  that  |/(7,  n)|  <  by 

Cauchy’s  theorem,  we  find 
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In  going  from  the  first  to  second  line  of  this  inequality,  we  used  the  fact  that 


|„|7e-2.«|n|  <  iV7(niax|n,|)^e-2’^*l"l  <  JJmax(l,  |n,|)e-2-«l"’-l  , 


so  that  E|„|<m  |nre-2-«l"l  <  Ar^(l  +  2Eilf=i 


Now  that  we  know  that  the  generating  function  is  well-defined,  we  can  proceed 
to  check  that  the  canonical  transformation  is  defined  and  analytic,  just  as  we 
did  in  Proposition  2.3  in  the  previous  section. 

Proposition  3.4.  If 

/8r(7+l)\^16jVT'+i||/|U,p 
V  (27r<5)T'+i  J  2w6pL  ’ 

p  <  and  47rS  <  1,  then  the  equations 

(14)  I^i+^  ,  and4>  =  4>+^  , 

define  an  analytic  and  invertible  canonical  transformation  (/,  0)  =  $(/,  ^)  on 
the  set 

Proof:  Just  as  in  the  proof  of  Lemma  2.3  we  begin  by  using  the  analytic  inverse 
function  theorem  to  check  that  (14)  can  be  inverted.  In  both  of  the  expres¬ 
sions  in  this  equation,  the  inverse  function  theorem  can  be  applied  provided 
II  \\a-26,p/2  <  1-  This  in  turn,  follows  immediately  from  the  estimate  in 
Proposition  3.3  and  Cauchy’s  Theorem. 

The  remainder  of  the  proposition  follows  if  we  check  that  the  transformation 
is  onto  the  domain  Aa-36,p/4-  (This  is  analogous  to  the  proof  of  Proposition 
2.3.)  Note  that  if  (/,  0)  G  Aa-26,pf2i 


II  II<t-2«,p/2  _  ^  (27r(5)'^+l  J 

by  the  hypothesis  of  the  Proposition,  while 


<  /8r(7  +  l)\^2iV^+^||/|U,^ 


<p/8  , 


Ilcr-5,p/4 


8r(7  +  l)\^8iV^+i||/|U, 
(27r5)T'+i )  2npL 


^  <  5/2  , 


again  by  the  hypotheses  of  the  Proposition.  Thus,  |7  -  /|  <  p/8,  while  |0  -  0|  < 
8/2.  This  implies  that  the  canonical  transformation  maps  the  set  Aa-26,p/2  onto 
Aa~3b,pi4t  and  hence  that  (/,(/>)  =  $(/,0)  on  this  set. 


□ 
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Remark  3.8.  For  a  vector  valued  function  like  on  a  domain  Aa,pj 

\\^\\a,p  -  1^1,  where  we  recall  that  |^|  is  the  f  norm  of 

This  is  the  origin  of  the  extra  factor  of  N  in  these  estimates. 


Step  2:  The  Newton  Step 

Now,  just  as  we  did  in  the  case  of  circle  diffeomorphisms,  where  we  trans¬ 
formed  our  original  diffeomorphism  with  the  approximate  conjugacy  function 
obtained  by  solving  the  linearized  conjugacy  equation,  we  transform  our  original 
Hamiltonian  with  the  approximate  canonical  transformation,  whose  generating 
function  is  and  show  that  the  difference  between  the  transformed  Hamil¬ 
tonian  and  an  integrable  Hamiltonian  is  of  second  order  in  the  small  quantity 
\\f\\a,p-  As  before,  we  will  use  this  fact  as  the  basis  for  a  Newton’s  method 
argument. 


Proposition  3.5.  Define  =  H  o  $(/,  =  h{I)  +  /(/,  0).  If 


/8r(7  +  l)>|^  16iV^+^ll/IU.P  ^ 

\  (27r(5)^+^  /  27r6pL 


p  <  and  Airh  <  1,  then  H  is  analytic  on  Aa~36,p/4.f 

the  estimates, 


\\h-h\\(r-S6,p/4  <  (f^  +  2) 


^2jV^+i  ll/ll,., p 
27:6  pL 


2 


and 


||/|lo-3«,p/4  <  2(n  +  2) 


( /8r(7  +  l)>|^2Ar^+i||/|U.p 
y  \  (27r(5)'^+^  /  27:6 pL 


2 


Remark  3.9.  The  important  thing  to  note  is  that  f,  the  amount  by  which  our 
transformed  Hamiltonian  fails  to  be  integrable  is  quadratic  in  the  small  quantity 
WfWl  p.  Just  as  in  Proposition  2.4  in  the  previous  section,  this  will  form  the  basis 
of  a  Newton's  method  argument,  which  will  allow  us  to  prove  the  existence  of  a 
quasi-periodic  solution  with  frequencies  u* . 
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Proof:  Using  Taylor’s  Theorem,  we  can  rewrite 


.  .1=  dS^ .  dS^  j.. 

+f{I,4>)  + j^{^{I  +  t^,<t>),^)dt. 

Prom  the  definition  of  S^,  we  know  that  {a;(/),  +  f{i,<t>)  =  /-(/,</>)  = 

Sin|>M  Thus,  we  can  define 

h{I)  =  hiI)  +  avera9e{JJ^{{-^{I  +  v-^)-^),-^)dvdt} 


df  ~  dS  dS 
+average{  (^(-^  +  +  average/ {I,  (j>{I,  i))  , 


where  average{g(I,  </>)}  =  Jj,ff  g(I,  (j>)d/>,  and 

=  fHi,<j>{l4>)) 


-  averagef{i,  4>{i,  0)). 

Remark  3.10.  Subtracting  the  average  of  the  three  quantities  in  f  insures 
that  when  we  expand  f  in  a  Fourier  series,  there  will  be  no  n  ~  0  coefficient  - 
this  was  used  in  solving  (11). 


Both  h  and  /  are  easy  to  estimate  using  the  estimates  of  Proposition  3.3  and 
Cauchy’s  Theorem.  For  instance, 

2||/|U,^  /8r(7  +  i)\^2jyT'+^||/IU,p 
-  p  V  (27r5)T'+i )  2r^6L 
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Finally,  we  have  the  estimate 

ii/^iu-3W4  <  E  ^  E 

\n\>M  \n\>M 

where  the  last  of  these  inequalities  came  from  using  the  definition  of  M  in  Propo¬ 
sition  3.3. 

If  we  combine  these  remarks,  we  immediately  obtain  the  estimates  stated  in 
the  Proposition. 


□ 

The  Induction  Argument 

The  induction  follows  closely  the  lines  of  the  induction  step  in  the  case  of  the 
circle  diffeomorphisms.  We  have  to  keep  track  of  two  more  inductive  constants 
-  pn  to  control  the  size  of  the  domain  of  the  action  variables,  and  Mn  to  control 
how  we  cut  off  the  sum  defining  at  the  stage  of  the  iteration.  Thus,  we 
define  our  original  Hamiltonian  H{I^(f))  =  HQ{I,(f))  and  set  h{I)  =  hQ{I)  and 
/(/,0)  =  /o  (/,  <?!>)•  Also  define 

•  ^n=  36(l+n^ »  71  >  0. 

•  ao  =  (7,  and  dn+i  =  C7„  —  4(5^,  if  n  >  0. 

•  Pq  <  and  =  Pn/8,  with  po  chosen  to  satisfy  the  hypothesis  of 
the  following  Lemma. 

•  eo  =  ll/IU,p>  and  e„  =  if  n  >  0. 

•  M„  =  |loge„|/(7ri5„). 

We  set  Hn+i  =  o  $„  =  hn+i  +  /n+i,  with  /„+i(/,0)  =  0,  where  is  the 
canonical  transformation  whose  generating  function  solves  the  equation 

with  and  a;„(7)  =  %^(/).  At  the 

stage  of  the  iteration  we  will  work  on  the  domain  =  {(-^j  <t>)  ^  x 

\  \I  ~  In\  <  Pn  ,  \Im{(j)j)\  <  (Jn  ,  i  },  where  In  is  chosen  so 

that  uJn{In)  =  and  we  define  Qn  =  max(l,sup  ||^^||,  "with  the 

supremum  in  these  expressions  running  over  all  I  with  \I  -  In\  <  pn- 
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We  then  have 


Lemma  3.1  (KAM  Induction  Lemma).  There  exists  a  positive  constant  ci 
such  that  if 


Co  <  2 


r(-y  +  i)i6Nn8 


and  po  < 


2-^1  L 


The  generating  function  satisfies 


-l.Pn+l  —  ^n+1* 

^n|Un+l,Pn+l  ^Sn+1- 


||5<||  ,  ^/8r(7  +  l)\^2iV^e„ 

II  nlk„  -  \^(27r5„)7+U  27ri  ' 

•  is  defined  and  analytic  on  Aa^~36n,Pn/4^i^ri)  0,'^d  maps  this  set  into 

•^OTn  -  26„  ,p„  / 2  (^Tl )  • 

•  ll/n+lIU  n+l)Pn+l  —  ^n  +  l* 

•  ll^n+1  ~  ^n||c7n+i,Pn+i  —  ^n+l- 

•  l-^n+l  ~  ^n\  Pn/^’ 


Before  proving  this  lemma,  we  show  how  the  KAM  theorem  follows  from  it. 
If  the  perturbation  /  in  our  Hamiltonian  is  sufficiently  small,  the  hypotheses  of 
the  Induction  Lemma  will  be  satisfied,  and  roughly  speaking,  the  idea  is  that  as 
n  00,  Hn{I,  (j>)  h°°{I),  an  integrable  system,  since  /„  0.  Since  all  of  the 

orbits  of  an  integrable  system  are  quasiperiodic,  this  would  complete  the  proof. 
However,  as  n  becomes  larger  and  larger,  the  size  of  the  domain  in  the  action 
variables  on  which  Hn  is  defined  goes  to  zero.  Thus,  we  must  be  a  little  careful 
with  this  limit. 

Begin  by  defining  ^  o  . . .  By  the  induction  lemma, 

:  Aa^-sSn,Pn/^(^n)  ^  (^o),  and  Hn  =  Hqo  In  particular, 

if  (l>^{t))  is  a  solution  of  Hamilton’s  equations  with  Hamiltonian  then 

^n-i{^^{t),  (l>^{t))  is  a  solution  of  Hamilton’s  equations  with  Hamiltonian  Hq. 
Consider  the  equations  of  motion  of  Hn- 

f  ^fn  :  /  _L 

Since  ||^iU„,p„/2  <  2enN/pn,  and  ||^|U„-«„,p„  <  enN/Sn,  the  trajectory 
with  initial  conditions  (l>o)  (for  any  (j)Q  6  T^),  will  remain  in  ^cr„-36n,pn/4(^T^) 
for  all  times  |^|  <  T„  =  2^,  by  our  hypothesis  on  cq,  and  the  definition  of  the 
induction  constants.  Furthermore,  if  {I'^{t),(p'^{t))  is  the  solution  with  these 
initial  conditions,  we  have 

max  [  sup  |/"(t)-7„|,  sup  |<;i"(t)  -  (w*t  +  <^o)|  )  <  2^"+^ne„Ar/p„5„  . 
|t|<T„  J 

Noting  that  the  inductive  estimates  on  /„  imply  that  there  exists  I°°  with 
liWin-^oo^n  —  we  See  that  for  t  in  any  compact  subset  of  the  real  line, 
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(7^(t),  +  (/)o)  (again  using  the  definition  of  the  inductive  con¬ 

stants).  Using  the  inductive  bounds  on  the  canonical  transformation  one  can 
readily  establish  that 


,Pn  +  l  —  ^2iV 


/8r(7  +  l)\^/  SN^ej  \ 
\{2irSjp+^  J  \27:SjPjLj 


=  A, 


.Pn  +  l 

,Pn+l 


<  (2iV  + 


16A  /8r(7-H) 
pjJ  V(27r(5„)'T+i 


Using  the  definition  of  the  inductive  constants,  we  see  that  the  sum  over  n  of  this 
last  expression  converges  and  hence  limn-.oo  +  ^o)  =  0*(^)) 

exists  and  is  a  quasi-periodic  function  with  frequency  uj*.  Similarly, 

iim,_oo  + M  -  <5fn{i^{t).rm  =  0, 

for  t  in  any  compact  subset  of  the  real  line. 

Combining  these  two  remarks,  find  that 

lim^^oo  ^n(/"(^),  r{t))  =  rm 

so  {I* {t) ,  (I)* (t))  is  a  quasi-periodic  solution  of  Hamilton’s  equations  for  the  sys¬ 
tem  with  Hamiltonian  Hq  as  claimed. 

□ 


Remark  3.11.  Note  that  this  argument  is  independent  of  the  point  0o  that  we 
take  on  the  original  torus.  Thus  it  shows  that  every  trajectory  on  the  unperturbed 
torus  is  preserved. 

Proof:  (of  Lemma  3.1.)  Note  that  Propositions  3.3,  3.4,  and  3.5,  plus  the 
assumption  on  the  induction  constants  imply  that  we  can  start  the  induction, 
provided  v4cro-3<5o,/3o/4(^o)  ^ctupi  (A)*  FVom  the  definitions  of  the  domains  and 
the  inductive  constants,  we  see  that  this  will  follow  provided  |/o  —  Ii\  <  po/S. 
To  see  that  this  is  so  we  note  that  a;o(/o)  =  ^i(^i)-  Thus,  uJo{Io)  —  a;o(/i)  — 
But,  <  m/po,  while 

+  J^  j\^ao  +  sli){h  -  Io)?dsdt  . 

Since  ||  (^)~^  ||  <  ^  and  this  implies  that  |/o  -  /i|  <  Po/8  by  the 

definition  of  the  induction  constants,  provided  <  1/2,  which  will  follow 
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if  the  constant  Ci  in  the  Lemma  is  sufficiently  large.  This  completes  the  first 
induction  step. 

Suppose  that  the  induction  argument  holds  for  n  =  0, 1, . . . ,  iiT  -  1.  To  prove 
it  for  n  =  iiT  we  first  note  if  is  defined  by: 

Sim  =  i  j:  , 

\n\<Mx 

then  by  Proposition  3.3,  we  have 

iic<M  ^  +  2NyeK 

27rL  ' 

Note  that  the  hypothesis  in  Proposition  3.3  becomes  pK  <  L/{2Q.kM]^^) 
where, 


Q.K  =  max(l,  sup  ||— ^ 

\I-Ik\<pk 


\I-Ik\<Pk 


Ck  (■{  i  640A^eo .  -i. 

<nmax(l  +  ^-^,(l-^-^)-i)<20, 

j=i  3=1  n 


using  the  definition  of  the  inductive  constants.  This  observation,  plus  the  hy¬ 
pothesis  on  po  in  the  inductive  lemma,  guarantees  that  the  hypothesis  of  Propo¬ 
sition  3.3  is  satisfied.  Thus,  by  Proposition  3.4,  the  canonical  transformation 
defined  by 


(15)  /  =  /+^,and0  =  0  +  ^^, 

<^9  dl 

is  analytic  and  invertible  on  the  set  Aa^^ssK^PK/^i^K),  and  maps  this  set  into 
Actk  ,Pk  )  • 

If  we  then  define  fx+i  and  as  we  defined  /  and  h  in  Proposition  3.5 

we  see  that 


ll/zC+l  llcTjc— 3(5ic,/!J/c/4  ^  T  2) 


( f  8r(7  +  l)\^  2Ny+^eK\ 

y  V  /  2'k6kPkL  j 


while 


||cr/c— ,Pa:/4  ^  {^K  d"  2) 


(  (8r{'r  +  i)Y  2m+hK\ 

\^\,(2;r5K)T'+V  2w6kPkL  ) 


If  we  use  the  bound  on  cq,  and  the  definitions  of  the  inductive  constants,  we 
see  that  the  quantities  on  the  right  hand  sides  of  both  of  these  inequalities  are 
less  than  e^+i.  The  proof  of  the  inductive  lemma  will  be  completed  if  we  can 
show  that  Aafc+i,pK+i{^K+i)  c  Aaj^~36K,PK/^i^K)^  TMs  follows  in  a  fashion 
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very  similar  to  the  proof  that  C  ^ao-3<5o,Po/4(^o),  which  we  demon¬ 

strated  above,  so  we  omit  the  details. 


□ 

Remark  3.12.  From  the  point  of  view  of  applications  of  this  theory  it  is  of¬ 
ten  convenient  to  know  not  just  what  happens  to  a  single  trajectory,  but  rather 
the  behavior  of  whole  sets  of  trajectories.  Simple  modifications  of  the  preceding 
argument  allow  one  to  demonstrate  the  following  variant  of  the  KAM  theorem. 
(See  [4] .)  Consider  the  family  of  Hamiltonian  systems 

(16)  He==h{I)-\-ef{I,(l))  . 

Suppose  that  there  exists  a  bounded  set  V  C  such  that  is  invertible 

for  all  lev,  and  that  for  every  e  in  some  neighborhood  of  zero  is  analytic 
on  a  set  of  the  form  AaA^)  =  (l>)  e  xC^  \  \I  -  i\  <  p  ,  for  some  I  G 

V  ,and  |/m(0j)|  <  o  ,  j  =  1, . . .  ,iV  }. 

Theorem  3.2.  For  every  ^  >  0,  there  exists  eo  >  0  such  that  if  \e\  <  cq,  there 
exists  a  set  Pe  CV  xT^,  such  that  the  Lebesgue  measure  of  {V  x  T^)\Pe  is 
less  than  6  and  for  any  point  {Io,(l>o)  G  Pe,  the  trajectory  of  (l^)  with  initial 
conditions  {IqjM  is  quasi-periodic. 

Thus  an  informal  way  of  stating  the  KAM  theorem  is  to  say  that  most  tra¬ 
jectories  of  a  nearly  integrable  Hamiltonian  systems  remain  quasi-periodic. 

Remark  3.13.  Just  as  in  the  case  of  Arnold’s  theorem  about  circle  diffeo- 
morphisms,  the  KAM  theorem  also  remains  true  when  the  Hamiltonian  is  only 
finitely  differentiable,  rather  than  analytic.  For  a  nice  exposition  of  this  theory, 
see  [11]. 
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KAM  theory  in  infinite  dimensions 

WALTER  CRAIG 


Abstract.  Many  of  the  nonlinear  evolution  equations  of  mathematical 
physics  can  be  viewed  as  infinite  dimensional  Hamiltonian  systems,  posed 
in  a  function  space  as  a  phase  space.  This  includes  nonlinear  Schrodinger 
equations,  nonlinear  wave  equations,  and  nonlinear  systems  related  to  the 
KdV  and  the  water  wave  problem.  There  is  a  recent  class  of  results  whose 
intent  is  to  describe  some  of  the  principal  features  of  the  phase  space  for 
these  equations,  through  extensions  of  KAM  theory  to  infinite  dimensional 
settings.  The  first  part  of  this  hour  talk  will  consist  in  a  survey  of  the  results 
of  several  authors  on  this  theme.  The  second  part  will  go  into  depth  on 
some  aspects  of  the  construction  of  invariant  tori  for  nonlinear  evolution 
equations.  The  talk  will  illustrate  the  connection  between  invariant  tori 
and  Anderson  localization  in  the  space  of  normal  modes,  and  will  discuss 
the  development  of  unstable  periodic  orbits  in  the  presence  of  resonance. 


§1.  Introduction 

It  IS  natural  to  feel  that  we  understand  ODE  in  much  more  detail  than  PDE, 
and  therefore  it  is  a  natural  impulse  to  regard  PDE  which  describe  time  depen¬ 
dent  phenomena  as  dynamical  systems,  posed  in  an  infinite  dimensional  phase 
space.  As  well  as  being  an  elegant  point  of  view,  it  is  a  successful  one  if  it  leads 
to  a  better  understanding  of  the  solutions  of  the  equations.  The  problems  that  I 
will  be  addressing  today  are  mainly  evolution  equations  which  are  Hamiltonian 
systems  with  infinitely  many  degrees  of  freedom.  In  the  last  several  years  there 
has  been  a  research  effort  to  extend  one  of  the  main  tools  of  finite  dimensional 
Hamiltonian  mechanics,  the  KAM  theory,  to  infinite  dimensional  settings.  The 
mam  goal  is  to  understand  some  of  the  principal  features  of  the  phase  space  in 
which  the  Hamiltonian  PDE  are  posed.  I  will  be  describing  the  work  of  several 
people  in  this  talk;  my  own  work  on  the  subject  has  been  in  collaboration  with 

E.  Wayne,  who  is  lecturing  on  finite  dimensional  KAM  theory  in  this  series  of 
talks. 
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I  will  describe  a  class  of  results  on  the  existence  of  invariant  tori  for  PDE, 
considered  as  infinite  dimensional  Hamiltonian  systems.  For  initial  data  on  these 
tori,  the  flow  of  the  Hamiltonian  system  corresponds  to  solutions  of  the  PDE 
which  are  time  periodic  and  time  quasi-periodic,  that  is  they  behave  regularly 
in  time.  These  solutions  contrast  in  character  with  those  exhibited  in  Dave 
McLaughlin’s  talk;  numerical  solutions  of  a  simultaneously  damped  and  driven 
version  of  the  nonlinear  Schrodinger  equation,  which  apparently  undergo  irregu¬ 
lar  behavior  when  tracked  over  long  time  intervals.  The  impression  that  we  have 
received  in  the  last  two  talks  on  the  KAM  theory  is  that  these  invariant  tori 
are  somewhat  delicate  structures,  depending  on  diophantine  properties  of  the 
frequency  vector  and  concerning  the  degree  of  smoothness  of  the  Hamiltonian 
system.  It  is  true  that  the  analysis  by  which  we  construct  them  is  detailed.  On 
the  other  hand  the  fact  that  they  exist  is  very  robust,  in  the  sense  that  they 
are  present  for  a  large  class  of  Hamiltonian  systems,  and  we  will  see  today  that 
there  are  invariant  tori  present  in  the  phase  space  of  a  large  class  of  PDE.  Indeed, 
if  we  somehow  consider  the  space  of  nonlinearities,  say,  for  the  nonlinear  wave 
equation,  then  most  -  in  the  sense  of  an  open  dense  set  -  do  possess  invariant  tori 
in  their  phase  space.  Failure  to  have  these  structures  can  only  occur  when  there 
is  either  (i)  uncharacteristically  severe  linear  resonance,  or  (ii)  lack  of  genuine 
nonlinearity  in  the  problem.  But  I  am  getting  ahead  of  myself,  and  will  start  by 
giving  you  the  examples  I  have  in  mind, 

§2.  Hamiltonian  systems 

This  is  a  selection  of  the  most  basic  PDE  that  come  to  mind  when  working 
on  the  infinite  dimensional  extensions  of  the  KAM  theory.  The  first  one  that 
comes  to  mind  is  the  nonlinear  wave  equation 

(1)  =  dlu  -  g{x,  u)  ,  0  <  a::  <  tt 

with  either  Dirichlet  boundary  conditions 

1/(0,  t)  =  0  =  tt(7r,  t)  (Dir.) 

or  periodic  ones 

u(x,  t)  =  u{x  -h  TT,  t)  (Per.)  . 

This  is  a  second  order  equation,  but  may  be  made  into  a  first  order  Hamiltonian 
system  with  the  following  Hamiltonian  function, 

H{u,p)  =  f  ^p{xf  +  +  G{x,u)  dx 

Jo 


where 


duG{x,u)  =  g{x,u)  =  g\{x)u  +  g2(x)u^  +  gz{x)u^  ^ - 
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p)  V-1  ^)\SpH{u,p) 


The  canonical  equations 


axe  easily  seen  to  be  equivalent  to  (1).  We  will  be  assuming  in  our  later  analysis 
the  technical  conditions  that  g{x,  u)  is  analytic  in  (x,  u)  and  periodic  in  x,  and 
if  we  consider  Dirichlet  boundary  conditions  (Dir.),  that  g{x,u)  =  -g[-x,~u). 
The  next  example  is  the  nonlinear  Schrodinger  equation, 

(2)  =\dl'rp-  Q{x,  0<x<2n 

where  w  also  consider  either  boundary  conditions  (Dir.)  or  (Per.).  Letting 

G{x,'ip,  il>)  be  a  real  valued  function,  such  that  d^G  =  Q,  this  equation  has  the 
Hamiltonian 

J  0 

and  equation  (2)  can  be  written  as 

with  J  =  i  representing  the  symplectic  form. 

The  third  natural  class  of  examples  is  equations  of  KdV  type, 

9tg  =  -  d^(g(x,  q)),  0<x<2n 


which  have  Hamiltonian 


p27r 

~  G(x,q)  dx 

do 


where  dqG  —  g.  The  equation  (3)  can  be  rewritten 

dtq=J6qHKdv{q). 

with  this  time  J  =  — (9^,  given  by  a  nonclassical  symplectic  structure. 

Related  m  origin  to  the  KdV  equation  is  the  Boussinesq  system  given  through 
the  Hamiltonian 

p2TT 

HB{g,p)  =  /  ip"  +  ig"  -  i(a,9)"  +  G{x,p,  q)  dx 
J  0 


which  results  in  the  system  of  PDE 


0  d, 
0 


p  +  dj,G{x,p,  q) 
g  +  \dlq  +  doG{x,p,q) 
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The  system  that  is  given  here  is  ill  posed  as  an  initial  value  problem,  and  it  is 
not  one  of  the  examples  that  we  will  pursue  in  this  talk. 

Finally  I  want  to  mention  one  of  my  favorite  systems,  albeit  one  of  the  least 
tractable,  the  water  wave  problem.  This  is  the  system  that  describes  the  evo¬ 
lution  of  surface  waves  in  a  body  of  an  inviscid  incompressible  fluid  which  is  in 
addition  taken  to  be  irrotational.  It  can  be  written  in  Hamiltonian  form 


where  the  Hamiltonian  comes  from  an  expression  originally  given  by  V.E.  Za¬ 
kharov.  For  the  most  convenient  formulation  for  (see  Craig  k  Sulem  [CS]) 
consider  potential  flow  in  a  fluid  region  R{t})  =  {0  <  a:  <  27r,  —h  <  2/  ^ 
which  means  that  the  velocity  potential  satisfies 

A(l){x,y)=0,  dy(l){x,-h)=0 

in  R{7]).  Letting  (f){x,rj{x))  =  ^{x)  be  the  Dirichlet  data  for  (j)  on  the  top 
boundary,  N  the  outward  pointing  unit  normal  to  the  free  surface,  and  G{rj)^  = 
V0  •  N-^Jl  +  fhe  Dirichlet-Neumann  mapping,  then  the  Hamiltonian  is 

given  by  the  expression 

n2'K 

(6)  +  §7?^  dx  . 

JO 

With  our  present  knowledge,  system  (5), (6)  is  too  difficult  to  fit  into  our  analytic 
framework.  It  does  remain  one  of  our  goals,  as  a  mathematically  challenging  and 
physically  relevant  open  problem. 

Exercise  1:  Show  that  equations  (1)  through  (5)  are  in  their  claimed  form 
as  Hamiltonian  systems,  with  the  given  Hamiltonian  functions  and  symplectic 
structures.  Do  the  linear  stability  analysis  about  the  zero  solution,  and  explain 
that  equations  (2)  (3)  and  (5)  are  linearly  stable,  (1)  is  linearly  well-posed,  and 
that  (4)  is  linearly  ill-posed. 


§3.  Results 

I  plan  to  explain  the  majority  of  the  results  in  the  context  of  the  nonlinear 
wave  equation  (1).  The  Hamiltonian  can  be  written 

H{u,p)  =  H2{u^p)  +  Hs^u)  -h  iJ4(u)  H 

where  He  is  homogeneous  of  degree  i  in  its  arguments.  Truncate  to  retain  only 
H2{u,p)  =  f^p^-h  l(da:uf  +  lgi(x)u^dx.  The  resulting  system  is  of  course  the 
linearization  of  (1)  about  the  equilibrium  solution  u  =  0, 

d^v  =  d^v  -  gi(x)v  . 


(7) 
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Using  the  eigenfunctions  {ipj{x)}  and  the  eigenvalues  {cjj}  of  the  operator 
H91)  =  —{d/dx)^  -h  9i{x)  with  the  appropriate  boundary  conditions  (Dir.)  or 
(Per.),  a  time  periodic  solution  to  (7)  is  given  by 

(8)  Vj  {x,  t)  =  r  cos{u;jt  +  ^)^j  (x) 

with  frequency  ujj  and  ‘action’  r^.  We  assume  for  convenience  that  all  eigen¬ 
values  satisfy  >  0;  in  any  CcLse  there  will  be  only  finitely  many  exceptions  for 
each  choice  of  gi,  and  the  rest  of  the  analysis  will  work  independently  of  them. 
The  general  solution  for  7.^(0,  tt)  initial  data  is 

00  00 

v{x,  t)  =  J2rj  cos{ujt  +  r?  <  +00  , 

j=l 

which  is  time  quasiperiodic  (QP)  if  the  set  of  frequencies  {uj,;  ^  0}  has 

a  finite  frequency  basis  over  the  rationals.  In  general  such  an  expression  is  al¬ 
most  periodic  (AP)  in  time.  Our  main  question  is  whether  any  of  these  periodic, 
quasiperiodic  or  almost  periodic  solutions  persist  for  the  nonlinear  problem  (1). 
For  most  nonlinearities  the  answer  is  yes,  at  least  for  some  of  the  above  mo¬ 
tions.  Let  us  focus  on  periodic  motions  with  frequency  close  to  one  of  the  linear 
frequencies  ujj^ . 


Theorem  1.  (Craig  &  Wayne  [CWl]  (1993))  There  is  a  set  G  in  the  space 
of  all  nonlinearities  such  that  if  g  e  G  then  there  exists  a  smooth  Cantor  family 
of  periodic  solutions  to  (1).  More  specifically,  there  exists  a  tolerance  r^, 
functions  {u{x,  t]  r),  Q{r))  for  r  G  [0,  ro),  and  a  Cantor  set  C  such  that  for  r  eC, 
then  u{x,t\r)  is  a  time  periodic  solution  to  (1)  with  frequency  0(r).  Moreover, 
u  is  analytic  with  respect  to  {x,t),  and  with  a  choice  of  phase 

\u{x,t;r)-rcos{n{r)t  +  ^)il;j^{x)\  < 

|n(r)-a;,-J  <  qrp  . 

The  nonlinear  solution  is  given  as  a  graph  over  the  unperturbed  orbits  of  the 
linear  solution  of  (7),  forming  a  Cantor  set  foliated  by  invariant  circles. 
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The  set  of  acceptable  nonlinearities  Q  is  large  in  the  set  of  analytic  nonlinear¬ 
ities,  indeed  it  is  open  and  dense.  In  fact  the  conditions  defining  Q  depend  only 
upon  the  coefficients  g\{x)^g2{'^)  and  ^3(^)5  3-jet  of  g.  The  condition  on  gi 
implies  that  the  linear  frequency  sequences  {coj}  avoid  exceptionally  resonant 
cases,  and  those  on  {gi,g2,g3)  ask  that  the  problem  be  genuinely  nonlinear.  For 
example,  the  potentials  gi{x)  =  m?  a  constant,  g2  —  ^  and  g^(x)u^  —  are 
included  in  Q  for  all  ao  7^  0  and  for  an  open  and  dense  set  of  parameters  w?  of 
full  Lebesgue  measure. 

This  result  has  a  direct  analog  when  applied  to  the  nonlinear  Schrodinger 
equation  (2),  where  the  behavior  of  the  frequency  sequence  c\j\^  <  Uj  <  C|ip 
makes  certain  aspects  of  the  problem  somewhat  easier,  [CW4].  I  will  give  more 
details  of  the  proof  of  Theorem  1,  which  is  by  an  approach  that  is  nontraditional 
in  KAM  theory.  First  however  I  want  to  paraphrase  several  earlier  results  along 
similar  lines. 

Theorem  2.  (Kuksin  [K]  (1987)(1993),  Wayne  [W]  (1990))  There  is  a  set 
Gi  such  that  if  g  eGi  then  equation  (1)  with  Dirichlet  boundary  conditions  has 
Cantor  families  of  quasiperiodic  solutions. 

The  version  of  E.  Wayne  gives  Q\  as  a  subset  of  the  analytic  nonlinearities 
which  is  not  dense,  although  it  is  of  large  measure  for  an  appropriate  sense  of 
measure.  S.  Kuksin ’s  version  of  this  theorem  is  through  the  study  of  equations 
of  the  form 


(9)  dtu  =  iuj{D,  a)w  -f  /  ,  0  <  a;  <  27r 

with  u}{j,  a)  ^  |j|P,  and  nondegenerate  in  a  certain  sense  with  respect  to  a  finite 
dimensional  set  of  parameters  a.  The  wave  equation  (1)  with  Dirichlet  boundary 
conditions  can  be  shown  to  fit  into  his  framework,  as  can  (2)  and  (3),  however  the 
frequency  asymptotics  exclude  the  resonances  and  near  resonances  of  the  periodic 
boundary  conditions  (Per.).  In  fact,  a  recent  preprint  of  Kuksin  &  Poschel 
[KP]  shows  that  the  nonlinear  Klein- Gordon  equation  with  Dirichlet  boundary 
conditions  fits  into  the  framework  of  (9),  using  in  part  a  global  transformation 
to  fourth  order  Birkhoff  normal  form. 

The  methods  developed  for  the  proof  of  these  results  also  provide  a  lot  of  detail 
about  these  solutions  to  nonlinear  wave  equations.  One  result  which  builds  upon 
Theorem  1  is  that  linear  resonance  produces  instabilities. 

Definition  3.  A  set  {oJj^}  of  linear  frequencies  is  in  resonance  if  there  is  a 
relationship 

_  _  ■_ 

k2  "  ’ 

for  integer  pairs  {ji,  kf). 

An  example  of  a  two  degree  of  freedom  system  in  linear  resonance  a;i  :  0^2  = 
1  :  1  is  given  by  the  Hamiltonian 

(10)  H{x,y)  =  \{yl  +  yl)  +  ^(.x\+xl)  +  ao{x\+xlf -^aix\  +  a2xi  , 
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for  {xi,X2,yi,y2)  e  R^. 

Exercise  2:  Find  the  families  of  time  periodic  solutions  of  the  Hamiltonian 
system 

X  =  dyH  ,  y  =  -d^H  , 

with  frequency  close  to  the  linear  frequency  ui.  Calculate  the  linear  stability  of 
these  periodic  solutions. 

Equation  (10)  is  a  model  problem  for  the  nonlinear  Klein-Gordon  equation 
(11)  d^u  =  d^u  —  m^u  +  (1  +  ao(a;))w^  ,  u(x,  t)  =  u{x  +  27r,  i) 
for  which  there  is  a  1  :  1  resonance  for  every  nonzero  frequency  =  ojoj  = 

Theorem  4.  (Craig,  Kuksin  &  Wayne  [CKWj  (1994))  For  the  coefficient  in 
equation  (11)  take  ao{x)  =  acos{2jx).  Then  there  exist  smooth  Cantor  families 
of  periodic  solutions  which  have  at  least  two  normal  hyperbolic  directions. 

Here  is  a  schematic  picture  of  the  situation: 


It  seems  to  be  a  general  principle  that  in  the  infinite  dimensional  phase  space 
for  these  PDE,  the  traveling  wave  solutions  (or  their  perturbations)  are  stable, 
elliptic  periodic  orbits,  while  the  standing  wave  solutions  are  unstable  or  unstable 
after  perturbation,  periodic  orbits  with  at  least  two  normal  hyperbolic  directions. 
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§4.  Small  divisors 

We  will  now  get  to  work  constructing  some  of  these  solutions.  In  order  to 
solve  (1)  we  take  the  point  of  view  of  embedding  a  torus  into  our  function  space 

oo  oo  _ 

(12)  s{x,^) = ■ 

j=i  j=ikez^ 

We  ask  that  $  €  T^,  so  that  Sj{^  +  7)  =  Sj{^)  for  7  C  Z^.  For  this  to  be  a  real 
torus,  the  sequence  is  to  satisfy  the  reality  condition  s{j,k)  =  s{j,—k).  If  the 
embedding  is  to  be  invariant  under  the  flow  of  equation  (1),  and  if  the  flow  is 
to  pull  back  to  the  linear  flow  ^  ^  on  the  torus  T^,  the  function  S{x,  f) 

must  satisfy 

(13)  {Q^d^fS-dlS  +  g{x,S)  =  0 

In  terms  of  the  eigenfunction  expansion  for  S  in  (12)  the  sequence  of  Fourier 
coefficients  {{j,  k)}  must  satisfy  an  equation  on  the  lattice 

(14)  m  •  k)^  -  u;])s{j,  k)  +  W{s){j,  k)  =  0 

with  a  nonlinear  (and  nonlocal  of  course)  function  W{s)  defined  through  the 
nonlinearity  g,  which  satisfies  in  some  appropriate  norm  ||W(s)||  <  0(||s|p). 

We  will  continue  this  discussion  for  the  case  of  time  periodic  solutions,  setting 
the  torus  dimension  N  =  I  and  linearizing  (14)  about  zero.  We  find 

(15)  {{n-k)^  -uj])s{j,k)  =  0 

with  one  solution  so{j,  k)  =  —  jo,  k  —  ko)  +  —  jo,  k  +  ko), 

and  fl  =  (wj^/ko),  corresponding  to  the  solution  (8)  of  the  linear  wave  equation 
(7),  ^(a;,^  =  rcosikoOi’joi^)- 

Exercise  3:  Make  this  correspondence  between  the  PDE  and  the  lattice  problem 
for  the  general  quasiperiodic  solution. 

Assume  that  there  is  no  exact  linear  resonance  for  (^j^,  or  that  there  is  at 
most  a  finite  resonance,  so  that  the  set  N  =  {(j>,  ±ke)]  {(^je/ke)  =  a;jo}  ^  Z^  is 
finite.  Let  Q  be  the  orthogonal  projection  onto  the  subspace  in  £^(Z^),  the 

subspace  of  the  linearly  resonant  modes,  and  let  P  =  1  -  Q.  Then  the  solution 
to  equation  (14)  can  be  decomposed  as 

(16)  s(i,  k)  =  {Qs  +  Ps)(j,  k)  =  so(j,  k)  +  u{j,  k)  . 

This  organizes  the  equation  (14)  as  a  Lyapunov- Schmidt  decomposition  into 
resonant  and  nonresonant  components; 

(17)  diag  {{Qk)'^  —  lo^)n  sq  +  QW{so  +  u)  =  0 
diag  ((r^fc)^  —  ^j)z2\jv  +  PTT(so  +  u)  =  0  . 


(18) 
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The  classical  Lyapunov-Schmidt  method  would  be  to  solve  (18)  for  a  sequence 
u  =  u(j,  h,  So,  Q),  and  then  to  find  (sq,  ft)  to  satisfy  (17).  The  latter  is  the  analog 
of  the  ‘frequency  map’  of  the  classical  aproach  to  the  KAM  theory. 

The  small  divisor  problem  is  already  evident  in  the  diagonal  elements  of  the 
operator  K(f2)  =  diag  ((OA;)^  —  w?).  The  standard  procedure  to  solve  (18)  might 
be  to  use  the  inverse  V(f2)  *  =  diag((nA:)^  —  a;?)~*,  however  for  typical  fre¬ 
quency  sequences  {uj}  and  most  Q,  the  eigenvalues  of  K(n)  (simply  its  diagonal 
elements)  are  dense  in  R,  and  in  particular  they  accumulate  at  zero.  To  overcome 
the  inherent  losses  in  the  problem,  we  will  adopt  the  Nash-Moser  modification 
of  the  Newton  iteration  method  to  solve  (18). 

Let  Bn  =  \j\  -|-  |fc|  <  Lo2”}\Af  C  be  a  sequence  of  lattice  do¬ 

mains  which  eventually  exhaust  Z^.  Then  we  propose  to  solve  the  approximate 
equations 

{V{n)+dnPWiso  +  Un))B„Vn  =  -(K(n)u„  -|-  PW{so  -|-  U„))b„ 

(19)  Un-\-i  —V,n  “h  Vn 

which  is  Newton’s  method  F'(u„)i;„  =  -F(u„),  u„+i  =  -f-  v„,  restricted  to 

the  sequence  of  approximate  domains  Bn-  Now  we  may  take  advantage  of  the 
rapid  convergence  properties  of  the  iteration.  However  the  problems  with  the 
linearized  operators  are  somehow  worse  even  than  F(^l)-^  as  we  have  to  find 
the  inverse  of  (K(fl)  -t-  P9„W^)b„,  which  has  off-diagonal  terms  as  well. 

There  is  a  strong  analogy  with  known  problems  in  statistical  mechanics  and 
locahzation  theory.  There  the  issue  is  the  competition  between  the  closeness 
to  resonance  of  random  potential  wells,  which  enhances  tunnelling,  and  the  dis¬ 
tance  between  them,  which  supresses  it.  To  emphasize  this  in  our  small  divisor 
problems,  we  will  use  the  terminology  that 

Hb„  =  Hb„  (n,  u)  =  (F(0)  -b  dnPW(so  +  «))b„ 
is  the  ‘Hamiltonian’  operator,  and  its  inverse 

Gb„  =  (F(0)  -f  duPW(so  +  n))B‘ 

is  the  ‘Green’s  function’.  Typically  in  our  KAM  problems,  K(n)  is  diagonal  and 
has  small  divisors  recognizable  by  the  fact  that  its  spectrum  is  dense  at  zero,  and 
dnPW  gives  rise  to  off-diagonal  terms  which  decay  in  the  off-diagonal  distance. 
Also  typical  is  the  fact  that  for  any  subdomain  A,  Ha  is  self-adjoint. 

§5.  The  Green’s  function 

The  last  section  of  this  lecture  is  a  description  of  our  analysis  of  the  Green’s 

function  that  is  at  the  heart  of  the  convergence  proof  for  the  iteration  scheme 
(19). 
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Definition  5.  A  lattice  site  x  =  (j,  k)  e  is  singular  if 
lV(n)l  =  |(D2fc2_w|)l<c(, 

and  otherwise  it  is  regular.  A  singular  region  is  a  group  of  singular  sites,  which 
we  usually  take  to  he  connected. 

Consider  a  selfadjoint  operator  W  on  a  lattice  region  A  C  (which  in  the 
course  of  our  iteration  will  be  the  restriction  to  A  of  duPW). 

Lemma  6.  If  A  is  a  region  containing  no  singular  sites,  and  if  i|W^'||op  < 

r  <  d,  then  the  Green’s  function  Ga  =  +  WT"  exists,  and  satisfies 

IIGaIIop  <  ^  ■ 

Proof.  The  Neumann  series  for  Ga  converges  in  operator  norm, 

OO 

= (y(n) + • 

}=o 

Our  approximate  solutions  u„  correspond  to  analytic  functions  of  (x,  0  >  there¬ 
fore  we  may  consider  sequences  which  decay  exponentially,  measured  with  the 


I|m||'  =  E  <  +°°  ■ 

(j,fc)=a:€Z2 

To  go  along  with  this,  we  manufacture  an  operator  norm 

\\W'\\a=  sup  Yl,  \W(x,y)\e‘’\’’’-'’^'‘  . 

Operators  with  finite  <r-norm  have  matrix  elements  which  decay  exponentially 

off-dli«o»al;  tl»t  I.,  If  Ill'll,  <  r,  then  \W'Ml  £ ■  F""*™ 

the  norm  has  the  algebraic  property  that  |ltPiW^2||<r  <  ||W^i||<7l|M^2lU.  and  Lemma 

6  works  equally  well  with  respect  to  this  norm  as  with  the  regular  operator  norm. 

The  diagonal  elements  of  F(n)  are  dense,  so  it  is  clear  that  there  will  be 
singlar  sites  in  for  n  large,  and  the  hypotheses  of  Lemma  6  will  not  hoW. 
What  we  must  do  is  to  insure  that  some  control  of  the  Green’s  function  on 
will  remain.  The  goal  is  to  show  that  for  a  Cantor  set  of  parameters  (r,  fi),  there 
is  an  inductive  choice  (r/2  <  (T„  <  cr  such  that  |1Gb„  ||<t„  <  l/^m  and  furthermore 
all  of  the  other  inductive  constants  stay  under  control. 

To  illustrate  the  methods  to  control  the  Green’s  function,  I  will  work  with  a 
model  problem,  which  will  have  the  full  character  of  the  estimates  at  one  iteration 
step.  We  make  the  following  assumptions  on  the  singular  sites  of  a  domain  B 

and  their  geometry.  .  ,  •  c 

(1)  The  singular  sites  of  B  are  contained  in  a  set  of  singular  regions  bj,  so  tnat 


UjSj  contains  all  of  the  singular  sites 
B\  Uj  Sj  contains  regular  points  only. 
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(2)  The  singular  regions  are  of  bounded  size, 

diam  (Sj)  <  Cq  . 

(3)  For  singular  regions,  the  Hamiltonian  operator  has  spectrum  bounded  away 
from  zero, 

dist  (spec  0)  >  ^  , 

where  S  «  d.  In  practice  in  the  induction,  the  tolerance  for  small  denominators 

d  —  6fi  — ^  0. 

(4)  Singular  regions  are  separated  from  each  other 

dist  {Sj,  Sm)  >  3£  . 

In  the  induction,  i  =■  will  be  a  divergent  series,  but  will  grow  less  rapidly  than 
the  radius  of  the  regions 

(5)  Finally,  we  consider  disks  C{Sj)  of  radius  I  about  each  singular  region  Sy 
By  requirement  (4)  these  are  pairwise  disjoint.  We  also  ask  that  Ec{s  )  has 
spectrum  bounded  away  from  zero, 

dist  (spec(i?c(s,)),0)  >  b  . 

The  singular  regions  of  the  lattice  T?  will  look  like  this: 


k 


j 


figure  3 


Conditions  (1)  through  (5)  will  not  hold  for  all  of  the  values  of  the  parameters 
(r,0).  At  the  nth  iteration  step,  the  parameters  for  which  any  of  (1)  tc  (5) 
fail  must  be  excised  from  consideration  and  the  approximate  solution  smoothly 
interpolated  through  the  excision.  This  is  the  origin  of  the  Cantor  nature  of  the 
sets  on  which  convergence  of  the  procedure  is  achieved,  and  the  but  not 
analytic  nature  of  the  bifurcation  branches. 
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Theorem  7.  If  the  above  hypotheses  are  satisfied,  ||TV'||ct  <r  «  d  and  <5,7 
and  e  are  chosen  appropriately,  then  the  Green’s  function  Gb  exists,  and  satisfies 

(20)  i<;i.(*,»)i  <  j  (f  ^ 

Proof:  We  will  use  a  resolvant  expansion  for  Gb^  based  on  the  block  diagonal 
decomposition  of  the  Hamiltonian 

(21)  Hb  =  Ha  Hsj  +  F  , 

where  F  consists  of  the  off-diagonal  coupling  terms  which  involve  lattice  sites  in 
A  coupling  to  those  in  Sj,  and  couplings  between  different  5/s.  In  particular,  if 
both  x,ye  A  or  else  x,y  e  5j,  then 

F(x,2/)  =  0  . 

Inverting  (21),  the  generalized  resolvant  identity  for  this  block  decomposition  is 
that 

(22)  Gb  =  G A  ©i  +  G a  Gs^^Gb  -> 

which  we  may  iterate  to  arrive  at  the  formal  expression, 

oo 

(23)  Gb-G  A®  3  Gsj  +  Gsj  (FGa  %  Gs^ )  • 

m=l 

The  object  is  to  prove  that  under  assumptions  (1)  through  (5)  this  expression  is 
convergent. 

The  first  step  will  be  to  see  that  the  term  which  involves  no  coupling  of  the 
blocks  obeys  the  correct  estimate  (20). 

<  forx,y^A 

d 

\GA®jGs,{x,y)\  <  ^  forx.yeSj 

—  0  if  x,y  are  not  in  the  same  block 

This  is  in  agreement  with  the  bound  (20). 

The  expression  involving  one  coupling  operation  between  blocks  is 

Ga  Gsj^Ga  Gsj  ? 

which  we  will  consider  by  cases.  Since  there  are  potentially  two  occurrances  of 
Gsj,  the  object  will  be  to  avoid  factors  of  1/6'^  in  the  estimate,  which  for  6  «r 
would  call  into  question  the  convergence  of  the  series. 
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In  case  x  £  A  and  y  G  Sj  (or  vice  versa), 

|G>i®jGsj.rG^©^  G5^.(a;,j/)|  =  |  GA{x,z)T{z,w)Gsj{w,y)\ 

zeA 

—  Z— ^  d  fi  ' 


Diminishing  the  exponential  rate  of  decay  by  a  small  amount  7, 


d6  ^ 

z^A 

w£Sj 

<  i  ^  ^ 

~6\dJ  ' 

where  we  used  that  \w  -  y\  <  Co  for  both  w,y  e  Sj. 

In  case  that  both  x,  ^  6  A,  or  x^^y  E  Sj  the  same  block,  we  have  Ga  0j 
Gsj^Ga  Gsj  {x,  y)  =  0,  since  the  interaction  operator  F  strictly  enforces  that 
the  lattice  sites  {z^^w)  be  in  distinct  blocks.  The  final  case  exhibits  the  need  for 
assumption  (4);  we  take  a;  E  5^  and  y  E  *9^,  j.  Then 

I  5^Gs,(x,2)r(z,w)Gs,K2/)|  <  I 


\ 


-jIz-wI 


Since  z  E  Sj  and  w  e  Sk  with  k  ^  j,  there  is  a  jump  between  lattice  sites 
included  in  the  exponential  sum  of  at  least  distance  I,  and  if  we  demand  that 


<  1 

0 


the  result  is  that 


<  G-e-^^/2  <  G  , 
0 


so  that  in  this  case  too  the  bound  is 

The  factors  (r/d)""  <  1  will  be  the  convergence  factors  in  the  sum  (23).  In 
order  to  continue  the  process  of  estimates  of  the  terms  of  it,  we  are  forced  to 
study  both  of  the  local  Hamiltonians  Hsj  Q-nd  The  following  result 

illustrates  the  role  played  by  the  assumption  (5). 
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Lemma  8.  Under  the  above  hypotheses,  the  local  Hamiltonians  in  a  neighbor¬ 
hood  of  a  singular  region  S  satisfy 

(26)  |Gom(*,»)l  <  I  . 

Proof:  If  both  x,y  e  S,  using  assumption  (5)  and  the  fact  that  Hc(s)  is  self- 
adjoint,  ^ 

\Gc{s){x,y)\< ^ 

which  is  a  sufficient  for  (26)  as  diam(5)  is  bounded.  However  if  one  of  x,y  ^  S 
this  simple  bound  will  not  give  us  an  estimate  of  exponential  decay.  To  do  better 
we  again  proceed  with  the  resolvant  identities, 

Hc(s)  =HD®Hs  +  r  , 

where  D  =  C{S)\S.  The  Green’s  function  will  satisfy 

Gc(s)  —Gd  ®  Gs  +  Gd  ®  Gs^Gc(s) 

(27) 

=  Gd®Gs  +  Gd®  GsTGd  ®Gs  +  Gd®  GsVGc(s)^Gd  ®  Gs  ■ 

When  X  ^  5  but  y  e  5, 

Gc{S)  ~  Gd  ®  Gs  +  Gd  ®  Gs^Gc(^s)  > 

and  the  first  term  vanishes  as  x,y  are  not  in  the  same  block.  The  second  term 
is  controlled  by  an  exponential  sum,  giving 


<  itl  1  l-S 


6  \'rP/2 


■(a-'y)\x-y\ 


which  is  sufficient  for  (26). 

If  both  x,yeD,  then  we  need  to  use  the  second  expression  in  (27).  The  first 
term  does  not  involve  S,  and  is  bounded  safely  by  (l/d)e  using  Lemma 

6.  The  second  term  vanishes  as  the  interaction  T  demands  a  change  of  block, 
and  we  are  left  with; 

|Gc(S)(®.l/)l  ^ 

+  I  GD{x,z)T{z,w)Gc{s)(w,p)T{p,q)GD{q,y)\ 


z£D  wES 
pes  q£D 


d 


^  ±^-(T\x-y\ 

-  d 


\x-z\--<r\z— 


\  -{a-'))\x-w\  -~{cT-^)\p-y\ 
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Since  in  this  expression  we  have  both  w^p  6  the  distance  between  them  is 
bounded,  and  again  we  satisfy  the  criterion  of  the  lemma,  which  finishes  the 
proof. 

This  almost  completes  the  proof  of  Theorem  7,  which  needs  only  one  more 
step.  To  make  a  good  estimate  of  the  terms  of  the  resolvant  expansion  of  degree 
m  >  2,  we  use  a  slightly  more  sophisticated  version  of  the  resolvant  expansion, 
in  which  we  alternate  the  generalized  resolvant  identity  with  respect  to  the  de¬ 
compositions  B  =  AUj  Sj  and  B  =  A'  Uj  C{Sj),  where  A'  =  B\  Uj  C{Sj).  That 
is,  we  can  write 

—  Ha'  ® ^c(Sj )  + 

and  therefore  alternating  the  use  of  the  two  decompositions,  we  write 
—  Ga  Gsj  +  Ga  GsjTGa*  Gc{Sj) 

oo 

+  [Ga  Gs^TGa'  0j 

m=\ 

(28)  X  {Ga  Gs,  +  Ga  Gs, TGa'  Gc(s,))  ■ 

The  convergence  of  this  series  with  uniform  exponential  bounds  gives  the  proof 
of  Theorem  7,  and  this  in  turn  can  be  easily  deduced  from  the  following  Lemma. 

Lemma  9.  Estimates  on  the  principal  term  of  the  series  (28). 

|Ga  ©,•  Gs,rG^,  Gc(s,)r'(x,2/)|  <  . 

The  proof  of  this  is  of  similar  character  as  the  above  arguments,  in  which  we 
consistently  avoid  factors  of  (l/<5)  which  would  ruin  the  convergence  estimates. 
We  do  this  by  insuring  that  each  summand  associated  with  a  lattice  site  in  either 
Sj  or  C{Sj)  must  enter  the  sum  associated  with  a  long  path  [x\z\w\p\q]r\y) 
from  a;  to  y  of  length  at  least  L  The  estimate  is  then  similar  to  (24), (25). 

I  will  leave  the  precise  details  of  the  argument  to  the  reader,  as  a  somewhat 
combinatorial  exercise. 
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GLOBAL  CENTER  MANIFOLDS  AND 
SINGULARLY  PERTURBED  EQUATIONS: 
A  BRIEF  (AND  BIASED)  GUIDE 
TO  (SOME  OF)  THE  LITERATURE 


Nancy  Kopell 


Center  manifolds  arise  when  a  critical  point  of  a  differential  equation  has  k  >  1 
eigenvalues  with  zero  real  part.  The  center  manifold  has  dimension  k  and  is  tangent 
to  the  eigenspace  of  the  k  eigenvalues  with  zero  real  part  [1]. 

In  general,  unlike  stable  and  unstable  manifolds,  center  manifolds  are  not  unique, 
and  can  be  constructed  only  locally.  Indeed,  there  are  examples  which  show  that 
the  more  differentiable  one  requires  the  center  manifold  to  be,  the  smaller  may  be 
the  center  manifold  that  can  be  constructed  [1,  p  29,30].  Some  important  examples 
for  which  a  center  manifold  is  global  are  so-called  singularly  perturbed  equations. 
These  are  equations  of  the  form 

(1^)  edx/dt  =  f{x,y)  x  =  {xu...,xn) 


(iyfdt  =  g{x,y)  2/ =  (2/1, . . .  ,  t/.^) 

where  e  «  1.  Here  the  {xi}  are  the  “fast  variables”  and  the  {yi}  are  the  slow 
variables. 

Equation  (1)  can  be  rewritten  as 
{‘^^)  dx/dr  =  f{x^y) 


(2^)  dyjdr  =  e  g(x^  y) 

where  r  =  i/e.  In  this  time  scale,  in  the  limit  as  e  — 0  there  is  a  manifold  of  critical 
points  defined  by 

(3)  f{x,y)-^. 
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Suppose  for  some  region  of  (x,  y)  the  matrix  df/dx  has  eigenvalues  with  real  part 
bounded  away  from  zero.  In  such  a  region,  the  manifold  (3)  of  critical  points  is 
called  “normally  hyperbolic”  and  is  a  center  manifold  parameterized  by  y. 

For  €  >  0,  most  of  these  critical  points  disappear  generically.  However,  there  is 
still  a  connection  to  center  manifolds  as  follows.  One  appends  the  equation 

(2c)  de/dr  =  0 

to  (2),  Then  in  any  region  in  which  the  points  satisfying  (3)  are  normally  hyperbolic, 
(2a-c)  has  an  m-hl  dimensional  center  manifold  in  x,  y,  e  space;  this  center  manifold 
is  localized  around  /(x,t/)  =  0,  e  =  0.  It  can  be  shown  [2]  that  for  e  sufficiently 
small,  such  manifolds  can  be  chosen  globally,  parameterized  by  y.  The  center 
manifolds  of  (2)  are  often  called  “slow  manifolds.”  (Usage  varies;  sometimes  “slow 
manifold”  refers  to  the  e  ^  0  limit  of  the  center  manifold,  i.e.  /(x,  y)  =  0.) 

The  importance  of  the  slow  manifolds  is  that  they  can  be  used  to  construct 
(allegedly)  approximate  solutions  to  (1)  or  (2).  These  so-called  “singular  solutions 
are  unions  of  solutions  to  a  pair  of  simpler  equations,  the  “slow  equations”  and  the 
“fast  equations.”  The  slow  equations  are 

(4)  dy/dt  =  g{x,  y) 

where  x  =  x(y)  satisfies  /(x,y)  =  0.  The  fast  equations  are 

(5)  dx/dT  =  f{x,y) 

where  y  is  a  constant.  There  is  a  large  applied  math  literature  that  discusses 
“matching  conditions”  under  which  solutions  to  (4)  and  (5)  can  be  pieced  together 
to  get  an  object  that  satisfies  some  extra  conditions,  such  as  periodicity,  boundary 
conditions  at  finite  values  of  t,  or  limits  as  t  ^  d=oo.  (See,  e.g.  [3-5];  [3]  is  a 
good  introduction.)  The  singular  solution  is  the  zeroth  order  version  of  this  pieced- 
together  object.  Better  (formal)  approximations  can  be  calculated  from  (1)  and  (2) 
according  to  the  theory  of  “matched  asymptotic  expansions”  ([3-5]).  Especially  for 
the  e  —  0  version,  the  construction  of  the  singular  object  is  in  general  much  easier 
than  the  construction  of  the  actual  solution  to  (1)  or  (2). 

In  much  of  the  applied  math  literature,  there  is  a  blanket  assumption  that  the 
singular  object  is  indeed  an  approximation  to  some  actual  solution  to  (1)  or  (2),  with 
the  desired  extra  conditions.  However,  it  is  also  well-known  that  this  assumption 
is  occasionally  grievously  incorrect.  (See  papers  in  [6]  on  “resonance”  for  a  well- 
studied  example  and  a  geometric  treatment.)  This  uncertainty  has  given  rise  to 
another  large  literature  on  methods  for  figuring  out  circumstances  under  which  the 
existence  of  a  singular  solution  implies  that  there  is  an  actual  one  for  which  the 
singular  solution  is  an  approximation.  The  methods  invented  to  do  this  come  from 
dynamical  systems,  topology,  functional  analysis,  and  nonstandard  analysis.  See 
[7,8]  for  references  to  some  of  that  literature.  The  methods  I  find  most  conceptual 
and  most  powerful  are  the  geometric  ones,  involving  both  topological  techniques 
and  dynamical  systems  techniques.  Here  I  shall  deal  only  with  the  latter. 
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One  problem  that  has  been  much  analyzed  from  many  points  of  view  is  the 
existence  of  traveling  wave  solutions  to  nerve-conduction  equations,  a  class  of  par¬ 
abolic  PDE’s.  The  desired  solution  is  a  homoclinic  orbit  to  the  associated  ODE 
obtained  by  using  the  traveling  wave  ansatz.  The  singular  solution  associated  with 
this  problem  has  two  slow  pieces  and  two  fast  pieces.  A  description  of  the  singular 
solution  can  be  found  in  [7,9,10]. 

The  first  rigorous  geometric  treatment  was  given  by  C.  Carpenter  [9],  working 
under  the  direction  of  C.  Conley.  Her  proof  was  a  topological  one,  based  on  fixed 
point  theorems.  In  1980,  R.  Langer  [11]  produced  a  proof  for  a  simple  subset  of 
the  class  studied  by  Carpenter;  Langer ’s  proof  was  a  dynamical  systems  version 
in  which  the  homoclinic  orbit  is  constructed  as  the  intersection  of  the  stable  and 
unstable  manifolds  of  a  critical  point.  The  advantage  of  Langer’s  version  was  that 
it  proved  local  uniqueness,  and  gave  further  information  that  led  to  the  proof  by  C. 
Jones  of  the  stability  of  the  homoclinic  orbit  as  a  solution  to  the  original  PDE  [12]. 
The  disadvantage  was  that  it  seemed  very  complicated  and  not  easily  generalizable. 

The  essential  difficulty  in  Langer’s  proof  arises  in  a  very  large  class  of  problems 
involving  the  construction  of  solutions  to  singularly  perturbed  equations  by  dynam¬ 
ical  systems  methods.  In  many  cases  (e.g.  boundary  value  problems,  homoclinic 
orbits,  heteroclinic  orbits)  the  desired  trajectory  is  to  be  constructed  as  the  trans¬ 
verse  intersection  of  a  pair  of  invariant  manifolds. '  To  establish  the  transversality 
involves  tracking  the  position  and  tangent  planes  of  the  relevant  manifolds  over 
global  distances.  Since  the  desired  actual  solution  (if  it  exists)  is  near  the  singular 
solution,  it  must  pass  close  to  the  slow  manifolds  described  above.  This  means  that 
the  invariant  manifolds  whose  intersection  is  the  desired  solution  also  must  pass 
close  to  the  slow  manifolds.  (The  relevant  manifolds  in  the  case  of  a  homoclinic 
orbit  are  the  stable  and  unstable  manifold  of  the  critical  point  to  which  the  orbit  is 
homoclinic.)  The  actual  trajectory  in  general  takes  time  of  the  order  r  =  0(l/e)  to 
pass  by  the  slow  manifold,  so  careful  tracking  is  needed  to  keep  control  of  estimates. 

In  [13]  and  [14],  Jones  et  al.  resolved  this  difficulty  by  proving  the  “exchange 
lemma,”  a  tool  for  tracking  invariant  manifolds  as  they  pass  close  to  a  slow  man¬ 
ifold.  This  result  gives  estimates  on  the  position  of  the  invariant  manifold  after  a 
sojourn  of  time  >  0(l/e)  near  the  slow  manifold,  providing  that  some  transversal¬ 
ity  hypothesis  is  satisfied.  This  hypothesis  is  on  submanifolds  of  the  simpler  fast 
system  (5),  and  hence  is  generally  easier  to  verify  than  hypotheses  about  the  full 
system.  A  proof  of  the  existence  of  a  particular  orbit  for  a  particular  set  of  equations 
then  reduces  to  verifying  a  set  of  transversality  conditions  for  that  problem.  (For  a 
verification  in  the  case  of  FitzHugh-Nagumo  nerve  conduction  equations,  see  [10]). 
The  exchange  lemma  machinery  then  constructs  an  orbit  to  the  full  equations. 

The  work  in  [13]  and  [14]  requires  that  the  manifold  being  tracked  have  one 
higher  dimension  than  the  number  of  positive  eigenvalues  of  a  critical  point  on  the 
slow  manifold  (i.e.  of  the  matrix  dffdx).  This  constraint  is  satisfied  for  nerve- 
conduction  equations,  but  not  for  many  other  kinds  of  singularly  perturbed  equa¬ 
tions.  In  [15],  S.K.  Tin  generalized  [13]  and  [14]  by  removing  this  constraint.  With 
the  more  general  result  in  [15],  Tin  et  al.  [8]  were  able  to  prove  in  a  fairly  sim¬ 
ple  way  a  general  theorem  about  boundary  value  problems.  (See  also  [16]).  The 
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tracking  techniques  also  apply  to  some  perturbed  Hamiltonian  systems,  in  which 
the  relevant  manifolds  of  the  fast  system  are  not  transversal  at  e  =  0,  but  for  which 
Melnikov  methods  yield  a  weaker  form  of  transversality  [17].  Finally,  the  tracking 
techniques  are  beginning  to  be  applied  to  equations  other  than  singularly  perturbed 
systems  (e.g.  weakly  perturbed  Hamiltonian  systems)  [15]. 
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ample  of  a  perturbed  nonlinear  Schroedinger  equation.  This  technique, 
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1  Introduction 

In  this  article  we  will  describe  the  use  of  Melnikov  Integrals,  together  with 
methods  from  invariant  manifold  theory  and  geometric  singular  perturbation 
theory,  in  the  construction  of  global  solutions  to  partial  differential  equations 
(pde’s).  These  methods  have  been  developed  in  dynamical  systems  theory  of 
ordinary  differential  equations  [23],  [62],  and  have  only  recently  been  modified 
for  and  extended  to  the  infinite  dimensional  setting  of  pde’s.  In  fact,  the  use 
of  Melnikov  integrals  together  with  geometric  perturbation  theory  for  the 
rigorous  mathematical  study  of  pde’s  is  really  very  new,  with  most  of  the 
developments  having  occured  since  the  summer  school  at  the  Mathematical 
Sciences  Research  Institute.  Our  purpose  here  is  to  describe,  briefly,  these 
very  recent  developments  and  to  provide  a  short  list  of  relevant  literature. 

As  with  most  methods  as  they  are  being  developed,  this  one  begins  from 
explicit  examples.  Over  the  past  year  and  a  half,  several  rich  pde  examples 
have  been  investigated,  and  the  study  of  one  pde  has  been  completed.  It  is 
clear  that  these  examples  could  be  extended  and  developed  into  general  meth¬ 
ods  for  classes  of  pde’s,  a  generalization  which  will  certainly  take  place  over 
the  next  few  years.  Here  we  will  focus  on  that  pde  which  is  the  most  com¬ 
pletely  understood  [the  nonlinear  Schroedinger  (NLS)  equation],  and  mention 
the  other  examples  with  references. 

Melnikov  integrals  are  defined  for  perturbations  of  Hamiltonian  systems, 

=  JVH  +  6G(g),  (1.1) 

where  ^belongs  to  a  linear  inner  product  space  T  (the  “phase  space”),  the 
Hamiltonian  i/  is  a  real  valued  functional  on  the  perturbation  G  T  T ^ 
J  is  the  symplectic  matrix 


and  6  is  a  small  positive  parameter.  Let  h[t)^  t  G  (— oo,-foo),  denote  a  pe¬ 
riodic,  quasiperiodic,  or  homoclinic  orbit  of  the  unperturbed  (6  =  0)  Hamil¬ 
tonian  system.  Furthermore,  let  I  denote  a  real  valued  functional  on  .7^, 

I  : 

which  is  a  constant  of  the  motion  for  the  unperturbed  Hamiltonian  system, 
i.e,  which  “Poisson  commutes  with  the  Hamilitonian  //,” 

[gradH,  JgradI)  =  0. 

Here  (*,  •)  denotes  the  inner  product  on  the  phase  space  T. 
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Definition  1.1  The  Melnikov  Integral  (based  upon  I)  is  defined  as 

i  Lt  ^ 

where  the  unperturbed  orbit  h{t)  is  periodic  or  quasiperiodic.  In  the  case  that 
the  orbit  h{t)  is  homoclinic,  the  temporal  integral  is  replaced  by 

lim 

i->-oo 

for  suitable  sequences  of  real  numbers  {Tfi }  and  {T^},  tending  to  —oo  and 
+00  respectively. 

Remark  1.1  i)  The  Melnikov  integral  for  periodic  orbits  appears  in  the 
method  of  averaging  [54]  [23],  For  quasiperiodic  orbits,  this  definition  of 
the  Melnikov  function  may  be  found  in  [56], 

ii)  The  sequences  {T~ }  and  {T^}  are  used  [31]  to  define  conditionally 
the  Melnikov  integral  in  those  cases  where  the  improper  integral  itself  does 
not  converge.  In  our  own  work,  the  constant  of  the  motion  /  can  be  chosen  to 
guarantee  convergence,  and  the  extra  freedom  given  by  the  sequences  {T~ } 
and  {T^ }  has  not  been  needed. 

Melnikov  integrals  are  used  to  assess  the  fate  of  the  orbit  h  under  the 
perturbation,  (c  >  0,  but  small).  “Does  h  persist  (as  a  periodic,  quasiperi¬ 
odic,  or  homoclinic  orbit)  for  the  perturbed  system,  or  does  the  perturbation 
destroy  h  ?”  The  Melnikov  integral,  together  with  geometric  analysis,  can 
provide  answers  to  such  persistence  questions. 

As  is  clear  from  its  definition,  the  Melnikov  integral  M/  provides  an  esti¬ 
mate  of  the  change  in  the  value  of  the  constant  of  motion  I  over  the  perturbed 
orbit.  Without  an  additional  geometric  setting,  this  change  provides  very 
little  information  about  persistence.  Nevertheless,  for  specific  applications 
it  is  wise  to  understand  very  well  the  behavior  of  M/  as  a  function  of  the 
parameters  of  the  problem  before  developing  the  geometric  setting.  We  have 
learned  this  point  of  view  from  experience,  and  have  advocated  it  for  some 
time  [48],  [51].  For  example,  if  the  Melnikov  function  never  vanishes,  one 
can  often  quickly  infer  (even  with  mathematical  rigor  [56])  that  persistence 
is  impossible.  Once  the  parameter  dependence  of  the  Melnikov  function  M/ 
is  thoroughly  understood,  the  real  mathematical  challenge  remains  -  to  use 
Ml  together  with  geometric  analysis  to  establish  positive  results  about  per¬ 
sistence.  This  complete  goal  has  been  achieved  for  several  specific  pde’s,  as 
described  below. 

Melnikov  analysis  was  developed  [54],  [31],  [32]  for  ordinary  differential 
equations  where  it  has  been  a  very  successful  tool  as  is  well  documented  in 
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the  literature.  See,  for  example,  [23]  [62].  It  was  first  extended  to  pde’s 
by  Holmes  and  Marsden,  who  developed  a  general  setting  [30],  which  they 
then  applied  to  a  nonlinear  beam  equation  and  to  a  perturbed  sine-Gordon 
equation  [29] .  This  work  initiated  the  use  of  Melnikov  methods  in  the  context 
of  pdes,  where  it  proved  the  existence  of  solutions  with  extremely  irregular 
behavior  in  time.  However,  numerical  studies  [9]  of  perturbed  sine~Gordon 
equations  did  not  observe  this  particular  type  of  irregular  behavior.  Rather 
a  very  different  type  of  chaotic  phenomena  is  seen  numerically. 

Remark  1.2  Frequently  in  the  literature  the  expression  “Melnikov  analysis 
for  pde’s”  is  used  to  describe  the  following  situation:  The  original  pde  (in  one 
space  X  and  one  time  t)  is  reduced  to  a  finite  dimensional  ode  in  x  through 
a  traveling  wave  ansatz.  This  ode  is  then  analysed  with  Melnikov  methods 
for  ode’s.  In  this  article,  we  do  not  use  this  terminology,  and  we  restrict  the 
expression  “Melnikov  analysis  for  pde’s”  to  describe  analysis  of  the  original 
pde  as  an  infinite  dimensional  dynamical  system. 

2  Background 

Our  own  studies  of  chaotic  behavior  in  pde’s  began  with  the  numerical  inves¬ 
tigation  of  the  damped  and  driven  sine-Gordon  equation  [9], 

uti  -  Uxx  +  sin  u  =  e[-aut  +  F  cosa;^],  (2.1) 

with  even,  periodic  boundary  conditions, 

u{~-x^t)  =  u{x^t) 

u{x L^t)  =  u{x^t). 

This  equation  provides  a  very  natural  starting  point  for  the  study  of  chaotic 
behavior  in  pdes,  since  it  can  be  viewed  as  the  continuum  limit  of  coupled 
pendula,  and  since  the  damped  and  driven  pendulum  is  a  prototype  [5]  for 
chaotic  behavior  in  ode’s.  These  numerical  experiments,  as  well  as  those 
for  the  closely  related  damped  and  driven  nonlinear  Schroedinger  equation 
(NLS), 

-2iqr  +  qyy  +  (^^qq*  -ljq  =  i£  [aq  +  T]  ,  (2.2) 

q{-y,-r)  =  q{y,T) 
q{y  +  hT)  =  qiy,T)> 

are  described  in  detail  in  the  survey  [52],  a  preprint  of  which  was  made 
available  at  the  summer  school.  In  addition  to  a  description  of  the  numeri¬ 
cal  experiments,  that  survey  describes  the  unperturbed  integrable  theory,  it’s 
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connection  to  the  perturbed  numerical  experiments,  and  our  initial  steps  to¬ 
ward  a  Melnikov  analysis  for  natural  finite  dimensional  discretizations  of  the 
perturbed  pde.  Since  that  detailed  survey  is  now  available,  in  this  article  we 
will  restrict  ourselves  to  a  very  brief  summary  of  the  numerical  experiments 
and  focus  upon  very  recent  analytical  developments  for  the  pde  which  were 
completed  after  that  survey  was  written. 

A  broad  overview  of  the  numerical  experiments  is  as  follows:  In  both 
the  perturbed  sine-Gordon  and  NLS  cases,  for  certain  ranges  of  parameter 
values,  the  long  time  behavior  is  chaotic.  For  the  sine-Gordon  case,  this 
chaotic  behavior  occurs  at  small  amplitude  \u{x,  i)|  <<  1;  hence,  one  expects, 
for  the  NLS  equation,  very  similar  chaotic  oscillations,  which  indeed  are  also 
observed.  (In  fact,  movies  of  the  numerical  experiments  have  been  made 
which  display  such  chaotic  behavior  [5],  [36].)  In  both  the  NLS  and  sine 
Gordon  cases,  several  key  features  include  (See  Figure  2.1  and  Figure  2.2): 

1.  Temporal  chaos  is  observed  at  small  values  of  the  perturbation  param¬ 
eter  6; 

2.  Spatially,  the  wave  forms  are  very  regular  and  consist  of  coherent  local¬ 
ized  solitary  waves  which  oscillate  chaotically  in  time  t; 

3.  Both  dynamical  systems  diagnostics  (Poincare  Sections,  Lyapanov  and 
information  dimensions)  and  the  spectral  transform  of  the  unperturbed 
integral  theory  indicate  that  the  “dimension  of  the  attractor”  can  be 
very  small.  In  the  simplest  case,  the  wave  consists  in  one  solitary  wave 
interacting  with  a  long  wavelength  background; 

4.  Spectral  transform  measurements  of  the  unperturbed  integrable  theory 
also  establish  that  the  fundamental  instability  which  causes  the  chaotic 
behavior  is  the  modulational  instability  of  this  long  wavelength  back¬ 
ground  (or  mean),  a  classical  instability  well  known  in  the  theory  of 
nonlinear  waves  [61]; 

5.  In  this  simplest  case,  the  lone  solitary  wave  jumps,  irregularly  in  time 
t,  between  a  location  at  the  center  {x  =  0)  and  location  at  the  edge 
{x  =  L/2),  the  only  two  spatial  locations  allowed  by  even  periodic 
boundary  conditions. 

This  last  point  makes  feasible  the  existence  of  a  “symbol  dynamics”  on  the  two 
symbols,  [C  (=center)  and  E  (=edge)].  Mathematically,  one  anticipates  the 
possibility  that  an  invariant  set  exists  on  which  the  dynamics  is  topologically 
equivalent  to  a  Bernoulli  shift  on  two  symbols;  that  is,  on  which  the  dynamics 
is  as  random  as  a  sequence  of  coin  tosses. 

In  any  event,  the  observed  chaotic  behavior  consists  in  the  interaction  of 
coherent  localized  spatial  structures  with  each  other  and  with  a  long  wave¬ 
length  mean.  Moreover,  the  modulational  instability  of  this  long  wavelength 
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Figure  2.1:  Unperterbed  Solitary  Wave 


Figure  2.2:  Instabilities  induced  by  increasing  F 

mean  is  central  to  the  chaotic  behavior.  Neither  of  these  crucial  characteristic 
features  is  present  in  the  initial  existence  theory  in  [30].  Further  analysis  is 
required. 

3  A  Perturbed  NLS  Equation 

We  study  a  perturbed  nonlinear  Schroedinger  equation  (PNLS)  of  the  form 

iqt  =  +  +  it[Dq-l],  (3.1) 

where  the  constant  G  (2}  1))  c  is  a  small  positive  constant,  and  Z?  is  a 
bounded  negative  definite  linear  operator  on  the  Sobolev  space  of  even, 
27r  periodic  functions  which  are  square  integrable  with  square  integrable  first 
derivative.  Specific  examples  of  the  dissipation  operator  D  include  the  discrete 
Laplacian  and  a  “smoothed  Laplacian”  given  by 


Dq  =  —aq  —  jdBq, 


(3.2) 
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where  the  operator  B  has  symbol  given  by 

un\  \  k  <  K, 

=  t>«. 

This  pde  is  well  posed  in  [60].  In  fact,  the  solution 

qit-,e)  =  F*{qin) 

has  several  derivatives  in  and  in  the  parameters  such  as  c,  with  the  exact 
number  of  derivatives  increasing  with  decreasing  e. 

Our  analysis  of  this  equation  begins  with  two  observations:  Firsts  when 
e  =  0,  the  unperturbed  NLS  equation  is  a  completely  integrable  soliton  equa¬ 
tion.  Second^  the  ‘‘plane  of  constants”  lie, 

He  \da:q{x,t)  =  0}, 

is  an  invariant  plane  for  PNLS.  In  each  of  these  two  cases,  [c  =  0  or  g  G  lie], 
the  behavior  of  solutions  can  be  described  completely.  In  the  first 

case,  this  description  is  accomplished  through  the  spectral  transform  of  com¬ 
pletely  integrable  “soliton  mathematics” ;  in  the  second  case,  it  is  accomplished 
through  “phase  plane  analysis”.  In  the  jargon  of  the  theory  of  dynamical  sys¬ 
tems,  our  methods  will  be  a  form  of  “local-global”  analysis,  where  at  times 
the  term  “local”  will  mean  close  to  the  plane  lie,  and  at  other  times  “local” 
will  mean  close  to  the  integrable  case.  In  any  event,  throughout  our  global 
arguments,  control  is  achieved  either  because  of  proximity  to  (i)  the  plane  He 
or  (ii)  6  =  0. 

3.1  Motion  on  the  Invariant  Plane 

On  the  invariant  plane  fl^,  the  equation  takes  the  form 

iqt  =  2[qq-u^]g  -  ie[aq+\],  (3.3) 

where  it  is  assumed  that  the  dissipation  operator  D  acts  invar iantly  on  lie  as 

bq  =  -aq, 

for  a  a  positive  constant.  Equivalently,  in  terms  of  polar  coordinates 

q  y/l  exp 

these  equations  take  the  form 

It  =  -2€[a/-f\/7  cos  0] 

Ot  =  — 2(7  —  cj^)  H — ^sin^. 

v7 


(3.4) 
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Figure  3.1:  Phase  Plane  Diagram  of  the  ODE 

When  €  =  0,  the  unperturbed  orbits  on  FIc  are  nested  circles,  with  Sut  u, 
circle  of  fixed  point  given  by  /  =  For  e  >  0,  the  perturbed  orbits  on  lie 
are  very  different  (see  Figure  3.1).  First,  only  three  fixed  points  exist:  O, 
which  is  a  deformation  of  the  origin;  Q,  a  saddle  which  deforms  from  the  circle 
Su;  and  P,  a  spiral  sink  which  also  deforms  from  the  circle  S^-  Formulas  for 
the  sink  P  and  the  saddle  Q,  together  with  their  associated  growth  rates,  are 
given  by 

Oj,  =  -tan-i  +  (3.5) 

^  rv//) 


UJ 


2lo 


tan  ^ - 7r  +  0(€) 

auj 


(3.6) 


(7p  =  ±2i^/Tu[l  -  ~  6a  +  0(€2) 

<7,  -  ±2^/^[l-aV]^  -  ea  +  0(6l).  (3.7) 

While  the  circle  of  fixed  points  for  the  unperturbed  {e  =  0)  problem 
does  not  persist  as  a  circle  of  fixed  points,  motion  near  Su;  remains  slow  for 
small  positive  e.  Introducing  the  variable  J 

J  =  7  - 


equation  (3.4)  is  written  as 
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Figure  3.2:  Phase  Plane  Diagram  of  the  ODE  in  the  j  0  coordinates 


Jt  =  — 2€  [a{J  +  +  \/j  +  cos  9] 

Ot  =  -2J-\--=l=smO.  (3.8) 

In  order  to  describe  the  slow  flow  close  to  this  circle  5u;,  we  rescale  the 
coordinates 


T  —  ut 

J  =  (3.9) 

where  v  =  y/l.  This  rescaling  is  suggested  by  the  expressions  (3.7)  for  the 
growth  rates  and  Uq.  In  these  scaled  coordinates,  the  equations  (3.4)  on 
the  plane  lie  f^^ke  the  form  of  an  0(z/)  perturbation  of  the  conservative  system 

jr  =  — 2(aa;^ +c<;cos^) 

Or  =  -2j  . 

Thus,  we  see  that  near  the  circle  the  slow  motion  is  approximated  as  a 
driven  pendulum,  (see  Figure  3.2)  with  energy 

^(i)  (sin  ^  .  (3. 10) 

This  completes  the  information  about  motion  on  the  invariant  plane  that 
we  will  use  in  this  general  overview.  More  information  is  needed  for  the 
detailed  mathematical  arguments.  This  additional  information  may  be  found 
in  [37],  [40], [52], [53], [44]. 

3.2  Two  Results  from  the  Integrable  Theory  of  NLS 

The  unperturbed  (£  =  0)  NLS  equation  is  a  Hamiltonian  system  on  the  func¬ 
tion  space  Hip, 

-tqt  =  jz  H, 

Sq 


(3.11) 
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with  the  Hamiltonian  H  given  by 


It  is  well  known  that  this  is  a  completely  integrable  Hamiltonian  system,  a 
fact  whose  verification  begins  from  the  Lax  pair 

V’x  =  (3.12) 

ifit  =  (3-13) 

where 

(0  2iXq  +  qx 

. 

-\-2iXq  -  qx  0 

and  where  denotes  the  third  Pauli  matrix  cr^  :=  diag{l,-l).  This  over 
determined  system  is  compatible  {dt^x  —  T  if  coefficient 

q  satisfies  the  NLS  equation.  Consequently,  one  can  use  this  linear  system  to 
develop  representations  of  solutions  q{x^t)  of  NLS. 

Focusing  attention  upon  the  ‘^spatial  flow”  (3.12),  the  integration  of  the 
NLS  equation  is  accomplished  through  the  spectral  theory  of  the  differential 
operator  L  =  L(g), 

--  d 

L  :=  - 

dx 

which  is  viewed  as  an  operator  on  L^(IR),  with  dense  domain  H^.  “Soliton 
mathematics”  provides  a  complete  description  of  the  solutions  of  this  unper¬ 
turbed  integrable  system;  however,  from  that  complete  theory,  we  will  only 
need  two  results: 

1.  Expressions  for  homoclinic  orbits; 

2.  A  natural  constant  of  motion  F,  together  with  an  expression  for  the 
Melnikov  function  Mp. 

Remark  3.1  It  is  natural  to  inquire  the  degree  to  which  integrable  results 
are  required  to  carry  out  “local-global”  analysis  for  pde’s.  In  fact,  no  in¬ 
tegrable  information  is  required,  provided  one  has  (i)  an  expression  for  the 
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unperturbed  homoclinic  orbit  and  (ii)  a  constant  of  the  motion  I  which  is 
related  to  the  geometry  of  the  pde  in  a  manner  that  allows  both  the  definition 
and  geometric  use  of  a  Melnikov  function  M/.  However,  integrable  theory 
provides  not  one,  but  a  rich  class  of  homoclinic  orbits.  Moreover,  as  will  be 
shown  below,  the  constant  of  motion  F  from  integrable  theory  is  so  natural 
from  a  geometric  perspective  that,  when  available,  it  should  definitely  be  used 
to  define  the  Melnikov  function.  We  take  the  viewpoint  that  integrable  theory 
sets  the  standard  toward  which  other  more  general  theories  of  unperturbed 
systems  can  strive. 


3.2.1  Integrable  Homoclinic  Orbits 

Fix  a  solution  of  the  NLS  equation,  q{x^  t)  G  for  which  z/  is  a  periodic  (or 
antiperiodic)  eigenvalue  of  the  operator  L(g),  which  is  purely  imaginary  and 
has  geometric  multiplicity  2.  As  described  in  [42],  this  eigenvalue  could  be 
associated  with  a  linear  instability  of  q{x^  t)^  and  we  assume  this  is  indeed  the 
case.  As  described  in  detail  in  the  surveys  [52],  [49],  [50]  and  in  the  original 
articles  [16]  and  [42],  Backlund  (Darboux)  transformations  at  (g,  p)  can  be 
used  to  construct  orbits  homoclinic  to  (a  torus  translate)  of  q(x^t):  Denote 
two  linearly  independent  solutions  of  the  Lax  pair  at  (g,  A  =  z/)  by  ^~), 
and  the  general  solution  of  the  linear  system  at  (^,  v)  by 

<^(x,^;z/;c+,c_)  =  ,  (3.14) 

We  use  (j)  to  define  a  transformation  matrix  [58]  G  by 


G  =  GiX;ua):=Nl^-/  ^  «  J  iV 


-1 


(3.15) 


where 


Then  we  define  Q  and  ^  by 


N  := 


<t>l  -^2 

L  (l>2  01 


(3.16) 


Q{x^t)  :=  q[x^t)  +  2(v  -  p) 


0102 


0101  +  0202 


(3.17) 


and 

^((r,^;A)  :=  G(A;  z/;  0)  0(a;,  A),  (3.18) 

where  0  solves  the  Lax  pair  at  {q,X),  Formulas  (3.17)  and  (3.18)  are  the 
Backlund  transformations  for  the  potential  and  eigenfunctions,  respectively. 
We  have  the  following 
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Theorem  3.1  (i)  Q{x^t)  is  an  solution  of  NLS,  with  spatial  period  27r , 

which  is  homoclinic  to  q[x^t)  in  the  sense  that  Q[x^t)  >  q^j^^x^t), 
exponentially  as  exp(— crjy|t|)  as  t  — y  dioo.  Here  qo^  is  a  torus  trans¬ 
late”  ofq.cTj,  is  the  nonvanishing  growth  rate  associated  to  the  complex 
double  point  v,  and  explicit  formulas  can  be  developed  for  this  growth 
rate  and  for  the  translation  parameters  0± . 

(ii)  "^{x.t;  A)  solves  the  Lax  pair  at  (Q,  A). 

Remark  3.2  This  theorem  is  quite  general,  constructing  homoclinic  solu¬ 
tions  from  a  wide  class  of  starting  solutions  q{x^t).  The  orbit  Q{x^t)  is 
homoclinic  to  a  torus  which  itself  possesses  rather  complicated  spatial  and 
temporal  structure,  and  is  not  just  a  fixed  point;  nevertheless,  the  homoclinic 
orbit  typically  has  still  more  complicated  spatial  structure  than  its  “target 
torus”.  These  Backlund  formulas  provide  coordinates  for  the  stable  and  un¬ 
stable  manifolds  of  the  critical  tori;  thus,  they  provide  explicit  representations 
of  the  critical  level  sets  which  consist  in  “whiskered  tori”  [4]. 

For  this  work  a  very  important  special  example  is  the  case  q{x,  t)  con¬ 
stant,  independent  of  x: 

q{x,t)  =  cexp{-z[2(c^  -  u;’^)t  -  7]}. 

A  standard  stability  calculation  shows  that  q{x,  t)  linearly  unstable,  with  pos¬ 
itive  growth  rate  cr  for  the  linearized  “cos  x  mode”  given  by 

(J  =  a/4c2  -  1. 

In  this  case,  two  linearly  independent  solutions  of  the  Lax  pair  are  given  by 


=  exp{±i[K(A)(a; -h  2A^)]}  X 


cexp{-z[2(c^  -  -  7]/2}  \  /o  iQ'v 

(±ac(A)  -  A)  exp{i[2(c2  -  Lj'^)t  -  j]/2}  j  ’ 


where  k{X)  is  given  by 


k(A)  =  c^. 

From  these  eigenfunctions,  one  sees  that 

p  =  -\/4c^  —  1 
2 

is  a  purely  imaginary  double  (  antiperiodic)  eigenvalue,  which  is  indeed  as¬ 
sociated  to  an  instability.  Hence,  the  Darboux  transformation  will  produce  a 
homoclinic  orbit: 

f  cos2p-  zsin2ptanhr  ±sinp  sech  rcosa:)  ^  Uq  pq) 

~  S  1  =p  sin  p  sech  r  cos  x  J  ’ 
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where 


r  =  (T{t-{-to) 

2c 

Here  ±  labels  a  symmetric  pair  of  homoclinic  orbits.  (Notice  that  —  cos  a;  = 
cos(a:  +  7r),  which  shows  that  one  sign  (+)  represents  an  excitation  centered  at 
a:  =  0,  while  the  other  sign  (—)  represents  an  excitation  centered  at  a;  =  tt,) 
If  we  specialize  to  c  =  oj,  q  lies  on  the  circle  of  fixed  points  5^^,  and 
the  orbit  Q  is  homoclinic  to  this  circle.  Thus,  from  one  point  of  view,  (3.20) 
provides  an  explicit  representation  of  a  “whiskered  circle”;  while  from  another 
viewpoint,  it  provides  an  explicit  representation  of  the  unstable  manifold 

W^S^)  =  W^{S^)=  U  qt  . 

7,io,± 


When  examining  the  “whiskers”  for  this  example,  one  notes  that  while 
Q  is  homoclinic  to  the  circle  5c^,  it  is  actually  heteroclinic  to  pairs  of  fixed 
points  on  with  the  individual  fixed  points  separated  by  a  “phase  shift” 


of  ~*4p, 


(3.21) 


(Here  c  =  lj,  although  this  formula  for  the  phase  shift  is  valid  more  generally 
and  actually  applies  for  an  open  set  of  values  for  c.)  This  phase  shift  will 
play  an  important  role  in  the  singular  behavior  of  the  persistent  homoclinic 
orbits,  as  well  as  in  the  construction  of  these  persistent  orbits  as  singular 
deformations  of  integrable  orbits  (3.20). 


3.2.2  The  Constant  F 

In  order  to  introduce  a  constant  of  the  motion  which  geometrically  is  the  most 
natural  for  constructing  a  Melnikov  integral,  we  must  describe  some  spectral 
theory  of  the  operator  L.  This  material  is  described  in  detail  in  [42],  [52]. 

Since  the  coefficient  g  is  a  periodic  function  of  x,  Floquet  theory  can  be 
used  to  characterize  this  spectrum.  Floquet  theory  begins  from  the  funda¬ 
mental  matrix  M  =  M{x;  A;  g),  which  is  defined  as  the  2x2  matrix  valued 
solution  of  the  linear  problem 

L't!)  =  A'^, 

whose  initial  value  at  a;  =  0  is  the  identity  matrix.  Next,  one  introduces  the 
transfer  matrix  T 


T(A;9)  =  M(27r;A;9). 
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Then  the  spectrum  a{L)  can  be  characterized  as  the  set  of  all  A  for  which  the 
2x2  matrix  T  has  eigenvalues  on  the  unit  circle.  Since  the  detT  =  1,  this  is  in 
turn  determined  by  a  single  scalar  function  called  the  Floquet  discriminant: 

A  :  C  X  //i,p  ^  C  %  A(A;  9)  :=  tr[T(A;  g)]. 

In  terms  of  A,  the  spectrum  is  then  given  in  terms  of  the  Floquet  discriminant 

by 

a(Z{q))  =  {A  €  C  I  A(A,  q)  is  real  and  -  2  <  A  <  +2}. 

The  Floquet  discriminant  A(A;  q,  q), 

A  ;  C  X  Hl  j,  X  Hl  j,  C, 

is  entire  in  A,  q,  and  q.  Its  first  variation  admits  the  following  representation: 
5A(A;  q,  q)  =  Sqix)  +  ^  <5g(x)J  dx, 

where 

^A(A;g,g)  =  -ifr  M-^x)  (  °  J)m(x  +  2^)^ 
j^A{\-,q,q)  =  M-^(x)  J  J  ^  M(x  +  27r)  .  (3.22) 

Here  M(x)  =  M(x;  A'",  q,  q)  denotes  the  fundamental  matrix.  To  verify  this 
representation,  one  [52]  proceeds  as  follows: 

(L-X)M  =  0,  M{0)  =  I 

{L-X)5M  =  ^M(0)  =  0. 

One  then  solves  for  SM{x)  by  “variation  of  parameters” ,  which  together  with 
the  definition, 

SA  =  tr  (^M(27r), 

produces  the  representation. 

The  Floquet  discriminant  generates  an  infinite  family  of  constants  of  the 
motion  for  the  NLS  system,  as  the  following  proposition  states: 

Proposition  3.1  (i)  Floquet  discriminants  Poisson  commute: 

{A(A;g,g),  A(A';^,g)}  =  0  VA,  A', 

where  the  Poisson  bracket  is  defined  as 

/•2^  /6FSG  6F6G\ 


dx. 
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(ii)  A{X;q^q)  is  a  constant  of  the  motion  for  the  NLS  equation  since  its 
Poisson  bracket  with  the  Hamiltonian  H  vanishes: 

{A{X),H}  =  Q  VA. 

Thus,  A(A;0  generates  an  infinite  family  of  NLS  constants  of  motion,  one 
for  each  A. 


The  proof  of  this  proposition,  may  be  found  in  the  survey  [52]. 

We  will  define  the  invariant  F  in  terms  of  the  Floquet  discriminant  A  (A;  ^ : 
Fix  a  potential  qo  G  Hl^  which  has  a  purely  imaginary  critical  point  A^, 

=  0. 

Ac 

Let  Nh  =  Nh{qo)  denote  a  small  neighborhood  of  q^,  and  consider  the  critical 
point  as  a  functional  on  this  neighborhood,  A*^  =  A^(g): 

-  0;  A^(^,)  =  A^ 

Definition  of  the  Invariant  F:  In  terms  of  this  purely  imaginary  critical 
point,  we  introduce  an  important  invariant: 

F  :  TVfe  M  given  by  F  A(A^(g);  g).  (3.23) 


^A(A;g) 


^^A(A;9o) 


The  functional  F  is  smooth,  provided  ^  A  (A,  5)  ^  0  for  all  q  &  Ni,.  To 
verify  this  smoothness,  one  calculates 


SF 


X=X-{q) 


provided  A^(^)  is  differentiable.  But  A^(g)  is  smooth,  as  the  following  calcu¬ 
lation  shows: 

A'(A%);g)  =  0 
A"(Al9);9)f +  ^  =  0. 

If  A"(A^(g);  q)  ^  0,  one  can  continue  to  differentiate  to  any  order. 

Critical  Points  of  F:  The  critical  points  of  the  functional  F  will  play 
important  roles.  To  understand  these,  it  will  be  useful  to  develop  a  formula 
for  the  gradient  F'{q)  in  terms  of  Bloch  functions. 


Remark  3.3  The  specific  eigenfunctions  (3.19)  used  in  the  example  q{x) 
c,  independent  of  x,  were  Bloch  functions. 
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Let  A)  denote  Bloch  functions,  that  is  eigenfunctions  of  L  at  [q,  A] 

which  are  defined  (up  to  normalization)  by  the  transfer  condition  across  one 
period: 

if{x +  2tt,X)  -  p{X)'<p{x,X).  (3.24) 

Here  p{X)  denotes  the  Floquet  multiplier,  which  is  related  to  the  Floquet 
discriminant  by 

p(A)  =  i[A(A)  +  v'A2(A)-4].  (3.25) 

The  functions  p  and  -if  are  well  defined  on  the  Rienmnn  surface  for 
(A,  a/A2(A)  -  4) ,  and  {x,  A)  denote  the  values  of  ^  on  the  two  sheets  over  A. 
At  branch  points  {simple  periodic  or  antiperiodic  points),  the  two  sheets  touch 
and  become  linearly  dependent.  (This  is  compatible  with  the  fact  that 
at  a  simple  eigenvalue,  the  eigenspace  is  one  dimensional.)  At  real  multiple 
points,  remain  linearly  independent,  while  at  complex  multiple  points  they 
may,  but  need  not,  become  dependent.  These  two  possibilities  at  the  complex 
multiple  points  are  a  key  to  this  nonselfadjoint  spectral  problem. 

In  any  case,  for  fixed  A,  these  Bloch  eigenfunctions  can  be  represented 
explicitly  in  terms  of  the  columns  of  the  fundamental  matrix  M{x-,X)  = 
column{Y^^\x-,  A),  y^^^(a:;  A)}: 

if^{x-,  X)  =  a*{M2i(l;  A)y(')(a;;  A)  +  [M22(1;  A)  -/)=^(A)]y(2)(a:;  A)},  (3.26) 

where  a±  denotes  normalization  constants. 

The  gradient  of  F  admits  a  beautiful  representation  in  terms  of  these 
Bloch  functions,  which  follows  from  equation  (3.22): 


Proposition  3.2 


grad  F(q,q)  =  i 


.x/A(A<=)'^-4 


i’t {x-,  X'=)'tp2  {x-,X‘ 


_y.+(x;A=)V>r(*;A^) 


(3.27) 


From  this  representation,  one  obtains  the  following 

Theorem  3.2  The  potential  q  is  a  critical  point  of  the  functional  F  if  and 
only  ifX^{q)  is  a  multiple  point  with  geometric  multiplicity  2. 


Remark  3.4  We  note  that  if  A'^  were  a  real  multiple  point,  it’s  geometric 
multiplicity  is  always  2.  Thus,  a  potential  q  for  which  A'’(g)  is  a  real  double 
point  is  a  critical  point  of  F.  However,  since  X''{q)  is  a  complex  double  point, 
the  geometric  multiplicity  may  be  either  2  or  1,  and  q  may  or  may  not  be  a 
critical  point  of  F. 


MELNIKOV  ANALYSIS  FOR  PDE’s 


67 


Critical  Tori:  To  understand  the  critical  points  of  F,  we  first  note  that, 
generically,  the  NLS  level  sets  are  tori  =  SxSxS  ■  -  of  infinite  dimension, 
with  the  radius  of  the  circle  measured  by  rj  =  |A(Ap:f  2|.  For  for  which 
Xj{q^)  is  a  multiple  point  (A(Ap  =  ±2),  rj  =  0  and  thej^^  circle  is  “pinched 
off” .  In  this  case,  the  function  q^  resides  on  a  torus  which  is  singular  in  the 
sense  that  it  has  one  dimension  less  than  maximal  because  Vj  =  0.  Consider 
the  singular  torus  associated  with  r  =  A(A^)  q=2  =  0.  Any  q^  on  this  singular 
torus  is  a  critical  point  of  F, 

Whiskered  Tori  and  Unstable  Manifolds:  The  existence  of  these 
critical  tori  is  guaranteed  by  inverse  spectral  theory  which  can  be  used  to 
construct  some  of  them  in  terms  of  finite  genus  theta  functions.  In  reference 
[42]  we  have  further  studied  these  critical  tori,  by  calculating  an  explicit 
representation  of  the  Hessian  of  F  at  g*.  If  the  multiple  point  A^(g*)  were 
real,  the  Hessian  shows  that  F  is  either  a  maximum  or  a  minimum  at  q:^,. 
However,  since  A^(g^)  is  a  purely  imaginary  double  point,  F  has  a  saddle 
structure  at  g*.  In  the  latter  case,  if  F  were  the  NLS  Hamiltonian,  q^  would 
be  an  unstable  fixed  point;  however,  F  is  only  in  involution  with  the  NLS 
Hamiltonian  H.  As  such,  q^  lies  on  a  singular  torus  which  could  be  unstable 
(hyperbolic)  under  NLS  dynamics.  If  the  complex  double  point  =  z/ 

is  indeed  associated  to  an  NLS  instability,  the  Lax  pair  for  the  NLS  flow  can 
be  used  in  a  Backlund  transformation  q^  qh^  to  provide  a  representation 
of  the  whiskers  for  the  critical  tori  on  which  q^  resides.  In  this  manner  one 
learns  the  very  important  fact  that 

W^^q^)=W^iq.)  =  {q\Fiq)  =  ±2}, 

with  the  ±  sign  depending  upon  whether  X^{q^)  is  a  periodic  or  antiperiodic 
eigenvalue. 

Remark  3.5  To  understand  these  singular  tori  more  completely,  one  [42] 
must  study  the  entire  sequence  of  invariants 

Fj{q)  =  A{Xj(q);q)  Vj  =  •  •  • , -2, -1,  0, 1,  2,  •  •  •  . 

However,  for  our  purposes  here,  we  don’t  need  this  degree  of  generality. 

4  The  Melnikov  Function  Mp 

In  this  section  we  construct  the  Melnikov  function.  First,  we  obtain  a  useful 
formula  for  the  gradient  F',  evaluated  on  a  whisker  qh. 

4.1  F'(9,) 

Let  q^  lie  on  a  critical  unstable  torus  with  the  instability  associated  with  the 
purely  imaginary  double  point  z/,  and  with  whisker  q^  represented  by  the 
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Backlund  formulas  (3.17).  An  explicit  representation  of  the  gradient  F'{qh) 
can  also  be  constructed  with  Backlund  formulas  (3.17,  3.18). 

We  begin  from  equation  (3.27)  for  the  grad  F, 


6F  _  .  -  4  /  ^  \ 


(4.1) 


where  A)  are  a  Floquet  basis  at  {Qh,  In  [42]  we  compute  this  limit 

using  the  Backlund  formulas.  The  result  is 


6F  ^  \ 


(4.2) 


where  the  constant  is  given  by 

C^:^i{iy-u)  Ja{u)A"{u) 


Remark  4.1  Since  $  =  c+^I+I  +  one  sees  explicitly  from  this  for¬ 

mula  that  ^1(5,,,.)  ^  0  as  c+/c-  -S-  0,oo.  Also,  since  the  eigenfunctions 

-ifM  and  at  the  complex  double  point  u  grow  or  decay  exponentially, 

«  exp  (±(Tj^^),  i  — >  00, 

the  formula  also  shows  explicitly  that  grad  -)■  0  as  t  oo.  (The 

vector  field  grad  F  must  vanish  because,  in  these  limits,  the  point  qh  on  the 
whisker  tends  to  a  critical  function  of  F.) 

In  the  case  of  9.  =  c  exp{-*  [2(0^  -  u^)t  -  7]}  =  c  ,  explicit  formulas 
can  be  used  to  produce  the  following  representation  of  F'  {qh),  which  is  valid 
when  acting  upon  even  functions  of  x: 


where  T  =  <j{t-  to),  tanp  =  a,  and  a  =  -  1.  From  this  representation, 

we  see  explicitly  that 


F'{qh{t))  +  0> 

lim  F'{qh{t))  -t  0) 

<[->•±00 


at  an  exponential  [exp(-(T|t|)]  rate  of  approach.  Thus,  9*  is  indeed  critical, 
while  the  whisker  qh{t)  is  not. 
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4.2  The  Melnikov  Integral 


In  this  subsection  we  use  these  Backlund  formulas  for  gradF  to  develop  an 
explicit  representation  of  the  Melnikov  integral.  First,  we  write  the  perturbed 
NLS  equation  in  the  form 

qt  =  iH\q)  eG{q)  (4.4) 

where  H^{q)  =  -q^x-2[qq~u)‘^]  q  and  where  G(^)  denotes  the  perturbation. 
Usually  we  will  consider  a  particular  example  of  this  perturbation: 

G{q)  =  -aq-  (4.5) 

where  the  operator  B  has  a  symbol  given  by: 

It: 

Definition  4.1  In  this  pde  setting,  the  Melnikov  function  is  defined  by 


/oo 

{F'[qh{t)],G[qh{t)])dt. 

-OO 


Next  we  specialize  the  orbit  qh{t)  to  one  homoclinic  to  a  circle  of  fixed 
points:  Recall  that,  on  the  plane  of  constants  11^  the  unperturbed  orbits  take 
the  form 

thus,  n=uj  represents  the  circle  of  fixed  points  Orbits  homoclinic  to  S^j 
are  denoted  qh  =  q^j  and  are  given  by 

rcos2p  —  isin2u  tanhr -l-sinusech  rcosx' 

q^{t)  =  - ^ - : - - -  X 

L  1  —  sin  p  sech  r  cos  a: 

a;expi[(96 -2p]  (4.7) 

where 


tan  p  =  \/ 4a;2  _  i 
T  =  (tanp)(t +  ^o). 

While  the  orbit  q^  approaches  the  circle  duS  t  ±(X),  it  approaches  the 
fixed  point  ivexpiOi  as  t  ^  —oo  and,  as  i  +(xd, 

q^{t)  u 

Thus,  the  (heteroclinic)  orbit  experiences  a  phase  shift  of  — 4p 


e-4ip  = 


I  —  i  v/4a;2  —  1 


With  these  ingredients,  we  assemble  the  final  expression  for  the  Melnikov 
integral: 
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Proposition  4.1  For  the  specific  perturbation  (4-5)  and  homoclinic  orbit 
equation  (4.7),  the  Melnikov  integral  takes  the  form 

/oo 

G{qu{t)))  dt  (4.8) 

-CXD 

=  [o^Ma  +  +  M (^fe)]  5 


where 


/oo 

{F'{qUt)),gUt))dt 

-OO 

/oo  ^ 

{F'{qu,{t)),  Bq^{t))dt 

-oo 

/oo 

{F'{q^{t)),  l)dt  . 

-oo 


More  explicitly, 


roo  /•27r  47t  SIR^  Po  sech  T 

=  /  dr  dx - - X 

J-oo  Jo 

[sech  T  +  sin  po  tanh^  r  cos  x  -  sin^  Po  sech  r  (2  +  cos^  x ) 
-\-2sm^  Po  sech  cos  x] 

roo  r27r  47r sin"^  Po  sech  T 

=  /  dr  dx - -  X 

J-oo  Jo 

j^sin Po  sech  r  cos  x  —  sin^  Po  sech  ( 1  +  sin^  x)  x 
[2  sech  r  -  sin  po  sech  cos  x  -  2  sin^  Po  sech  r 
+2sin^Po  sech  ^rcosa;]  +0(sin^“^  Po) 


POO  r2'K  47ra;sin^Po  sech  r 
M{ei)  =  cos(<96  -  2p,)  dr  dx - - x 

[  sech  r  ~  sin  po  cos  x] 

where  po  =  tan“^  -  1,  ^  =  [1  -  sinpo  sech  r  cos  x]  and  where  the 
0(sin^~^  po)  term  in  the  hdg  equation  is  due  to  the  fact  that  we  used 
instead  of  B  in  our  computation.  Thus,  the  final  expression  for  the  Melnikov 
integral  is  of  the  form 

M{oi,  fi,  0}))  —  oiMa  +  +  Ml  cos{0i,  —  2po),  (4-9) 

where  Ma.Mp,  and  Mi  are  functions  of  uj  only. 

Clearly,  for  small  a  and  f3,  this  Melnikov  function  has  simple  zeros  as  a 
function  of  6h.  At  issue,  of  course,  is  the  geometric  meaning  of  these  zeros. 
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5  Persistent  Homoclinic  Orbits  for  PNLS 

Simple  zeros  of  the  Melnikov  function  (4.9)  enable  us  to  prove  the  following 
persistence  theorem: 

Theorem  5.1  The  perturbed  NLS  equation  (3.1)  possesses  a  symmetric  pair 
of  orbits  which  are  homoclinic  to  the  saddle  fixed  point  Q,  provided  the  param¬ 
eters  lie  on  a  codimension  1  set  in  parameter  space  which  is  approximately 
described  by 

a  ~  E{u)/3. 

Remark  5.1  Calculations  in  the  proof  of  this  theorem  provide  formulas  which 
describe,  approximately  for  small  e,  characteristic  properties  of  these  homo¬ 
clinic  orbits  such  as  the  constant  E{(jj)  and  an  important  Hake-off”  point. 

These  two  homoclinic  orbits  differ  by  the  location  of  a  transient  spatial 
structure  -  a  solitary  wave  which  is  located  either  at  the  center  {x  =  0)  or 
the  edge  {x  -  tt)  of  the  periodic  box.  As  such,  this  theorem  provides  a  key 
necessary  step  toward  a  symbol  dynamics  for  the  pde. 

The  proof  of  this  theorem  is  organized  with  “local-global”  analysis.  It 
involves  normal  forms  for  the  perturbed  NLS  equation  [47],  invariant  manifold 
theory  for  NLS  and  geometric  singular  perturbation  theory,  combined  with 
integrable  theory  and  Melnikov  analysis.  It  is  described  in  mathematical 
detail  in  [44].  Here  we  outline  the  steps  in  the  proof,  emphasizing  the  intuition 
behind  each  step. 

The  steps  in  the  proof  are  organized  in  the  subsections  which  follow: 

1.  Preliminary  set  up  including  (i)  motion  on  H^  (ii)  coordinates  near 
He,  (iii)  linear  stability  and  time  scales,  and  (iv)  a  normal  form. 

2.  Local  arguments  including  (i)  persistent  invariant  manifolds,  (ii)  fiber 
representations,  and  (iii)  the  height  of  W^{Q). 

3.  Global  arguments  including  (i)  integrable  theory,  (ii)  the  first  Mel¬ 
nikov  measurement,  and  (iii)  the  second  Melnikov  measurement. 

When  working  through  this  overview  of  the  proof,  keep  in  mind  that,  through¬ 
out  the  proof,  control  is  obtained  in  one  of  two  ways  -  -  either  the  orbits  are 
(i)  close  to  the  invariant  plane  H^,  or  they  are  (ii)  close  to  the  integrable  case. 
Also  keep  in  mind  that  the  arguments  will  be  a  form  of  “shooting” ,  where  the 
goal  will  be  to  force  an  orbit  to  “hit”  target  manifolds  of  high  dimension,  but 
in  an  infinite  dimensional  space.  To  make  these  manifolds  easy  targets,  we 
make  them  very  large  in  the  sense  that  they  will  be  codimension  1.  In  any 
case,  the  overview  of  the  proof  follows: 
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5.1  Preliminary  Set  Up 

There  are  four  preliminary  steps  before  the  proof  really  begins.  These  are 
(i)  a  study  of  motion  on  the  invariant  plane  lie:  (ii)  the  definition  of  local 
coordinates  near  Re;  (iii)  the  use  of  these  local  coordinates  for  a  linear  sta¬ 
bility  analysis  which  identifies  two  time  scales  which  result  from  a  gap  in  the 
linearized  spectrum;  and  (iv)  the  development  of  a  near  identity  transforma¬ 
tion  which  places  the  local  coordinates  in  a  “normal  form”.  Each  of  these 
preliminary  “set-up  steps”,  while  tedious,  are  essential  to  the  proof. 

5.1.1  Motion  on  Re 

Motion  on  the  plane  has  already  been  described  in  Section  3.1,  and  is  sum¬ 
marized  by  the  figures  3.1  and  3.2.  In  particular,  we  remind  the  reader  of 
the  two  fixed  points  near  the  unit  circle  -  the  saddle  Q  and  the  sink  P,  with 
their  growth  rates  on  the  plane  R^  given  by  (3.7).  As  described  in  Section 
3.1,  when  restricted  to  a  “resonance  band”  near  the  circle  S^,  the  motion  will 
be  slow.  In  particular,  in  the  resonance  band,  control  is  achieved  by  the  en¬ 
ergy  of  the  driven  pendulum,  equation  (3.10).  In  particular,  with  that  energy 
we  can  obtain  and  control  global  representations  of  the  stable  and  unstable 
manifolds,  restricted  to  the  plane  Rc,  of  the  saddle  point  Q  -  W  {Q)  and 

5.1.2  Coordinates  Near  Rc 

The  next  preliminary  step  is  to  introduce  coordinates  which  will  be  useful 
locally  near  the  plane  Rc,  or  more  specifically,  near  the  circle  5u,.  This  entails 
introducing  coordinates  (J,  0,  /),  where  6  is  the  angle  on  S^,  J  is  a  measure 
of  distance  from  on  the  plane  Rc,  and  /  is  in  the  orthogonal  compliment 
of  Rc.  These  coordinate  changes, 

are  arrived  at  in  the  following  manner: 

First,  we  introduce  coordinates  {p,  0,  f)  given  by 

q  ■-\p{t)  + f{x,t)]expi9{t),  (5.1) 

where  p  and  6  are  polar  coordinates  on  the  plane  Rc,  and  /  €  R^,  i.e.  /  has 
spatial  mean  zero. 

The  norm  is  a  constant  of  motion  for  the  unperturbed  {e  =  0)  flow; 
therefore  the  perturbed  equations  can  be  somewhat  simplified  by  replacing 
the  variable  p  with  the  variable  /  : 

=  T  r[p^  +  ff]dx  =p^  +  {ff).  (5.2) 

27r  Jo  Jo 
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Finally  since  we  are  working  in  a  neighborhood  of  the  circle  of  fixed 
points  which  corresponds  to  /  =  it  will  be  convenient  to  introduce 
the  variable  J  defined  by 

J  =  I  -  (5.3) 

In  terms  of  these  variables,  the  perturbed  NLS  equation  takes  the  form 

Jt  —  ~2£  [(x^J  +  J  ui^  cos  9]  +  (if,  9,  f]  e) 

9t  =  -2J +  e{J  +  uY'^^^  sm9  +  £2iJ,9,f;e)  (5.4) 

+  L/  +  ^Qs{f)  +  £z{j,9,  f]t)^ 

where 

Lcf  =  fxx  +  ^  ^  Df  4"  2uj‘^  (/  +  /) 

vj  =  2J(/+/)  +  £^^=/, 

VJ  4* 

and  where  Sk  are  27r  -periodic  functions  in  0^  of  order 

£i(J,9,f-,e)  =  Oief) 

£2{J,9J-,€)  =  0{f) 

£3{J,9,f-,e)  =  0{jf  +  f) 

for  small  J  and  /. 

5.1.3  Linear  Stability  and  Time  Scales 

The  linearization  of  equations  (5.4)  is  a  perturbation  of  the  linear  system 

=  0 

Ot  =  -2J  (5.5) 

ift  =  LJ. 

This  is  a  constant  coefficient  linear  system  whose  growth  rates  are  easy  to 
compute.  First,  it  is  convenient  to  introduce  real  coordinates: 

/  =  +  :=  u, 

in  terms  of  which  the  linear  system  takes  the  form 

Jt  ^  0 

Ot  =  -2J  (5.6) 

Ut  =  Lf  U . 
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Thus,  we  must  consider  the  eigenvalue  problem 


Lfe  =  X 


for  the  eigen-pairs  {e{x),  A}.  Using  Fourier  expansions,  one  finds  for  j  -  1: 


where 


1  T 

=  cosx 

20ra; 

=  |±(T-0(e)|, 


-  1. 


For  i  >  2,  the  eigenvalues  come  in  complex  conjugate  pairs,  with  negative 
real  part: 

Xj  =  iQj-0{e), 


where 


-  =jJf  -  4w2  >  0 


Thus,  one  identifies  a  very  important  gap  in  the  spectrum.  In  the  “cos  a: 
direction  in  function  space,  the  growth  and  decay  rates  are  while  all 

other  growth  and  contraction  is  slow.  Specifically,  on  the  plane  lie,  ths  ex¬ 
pansion  and  contraction  are  0{y/e).  All  other  modes  {cosia:,j  >  2}  decay 
at  the  slow  0(e)  rate.  This  gap  in  the  growth  rates  will  play  a  central  role 
throughout  the  analysis. 

In  terms  of  the  eigen-basis,  the  mean  zero  function  u  may  be  written  as 


u{x)  =w„eu(s)  -I-  VsCsix)  -b  Vo{x) 


(5.10) 


where  and  are  real  scalars,  and  where  Uo(*)  €  [span{Wc,eu,es)\^-  In 
terms  of  these  variables  the  linear  equations  (5.6)  split  into 

=  0 

9,  =  -2J 


Thus,  we  explicitly  see  that,  for  £  —  0  the  linear  equations  have  one  unstable 
(cos a:)  direction  e^,  one  stable  (cos a:)  direction  e^,  and  an  infinite  number 
of  center  directions  (J  ,9  ,Vo).  Combining  these  center  variables  as  - 
(J,  9,  U(,)^,  the  linear  equations  can  be  written  as 
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where  A  is  defined  from  equations  (5.11), 

In  a  5  neighborhood  of  the  circle  of  fixed  points  Scjj  the  nonlinear  equation 
can  be  viewed  as  a  perturbation  of  the  linear  equation  (5.12).  Under  the  flow 
of  this  linear  equation,  and  for  c  =  0,  5^^  has  one  dimensional  stable  and 
unstable  manifolds,  together  with  a  codimension  2  center  manifold.  We  focus 
our  attention  on  the  center  manifold  E^(Su,),  together  with  the  center-stable 
and  center-unstable  manifolds: 

E^'^iSu,)  =  span{€u}'^ 

=  span{es}'^ 

=  span{eu ,  es}-^ 

We  will  be  interested  in  the  persistence  of  these  manifolds,  under  both  the 
nonlinearity  and  the  perturbations. 

5.1.4  A  Normal  Form  of  the  Equations 

The  stable  manifold  of  the  saddle  Q,  W^{Q)^  is  a  codimension  2  manifold  for 
which  we  will  need  global  control.  On  the  plane  flc,  this  control  is  achieved 
with  the  pendulum  energy  (3.10).  Off  the  plane,  we  need  to  estimate  its 
“height”  in  order  to  complete  the  last  step  in  the  argument  call  the  “Second 
Measurement” . 

In  order  to  illustrate  the  issue,  consider  the  following  model  problem: 

ft  =  -if-€f+\f\f. 

The  origin  is  a  stable  fixed  point  for  this  equation.  How  large  is  it’s  stable 
manifold,  ly^(O)  ?  A  quick  calculation  shows  that 

/  =  cexp  (—it) 

is  a  nearby  periodic  solution,  and  that  W^(0)  cannot  be  any  larger  than  0(e). 
Next,  consider  the  similar  model  problem 

ft  =  -if-ef  +  f, 

which  is  also  quadratically  nonlinear.  The  former  example,  together  with  the 
following  scaling  calculation,  both  indicate  that  if  |/|  >  0(e),  the  nonlinear 
term  might  effect  the  decay: 


=  ef 

=  -if-<f  +  P)- 


f 

ft 
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On  the  other  hand,  if  the  nonlinearity  was  of  higher  power,  the  effect  would 
be  diminished: 

9t  =  -ig-if  +  f< 

9  = 

gt  =  +  V^)- 

One  sees  that,  for  cubic  nonlinearities,  the  decay  will  not  be  effected  for 
fi  >  1/2.  That  is,  g  can  be  0(e2+^)  without  effecting  the  decay. 

For  a  quadratic  nonlinearity,  one  is  simply  unsure  of  its  effect  without 
more  detailed  information  about  its  specific  nature.  If  it  really  doesn’t  effect 
the  decay,  we  should  be  able  to  transform  it  away.  Such  a  transformation 
approach  is  the  philosophy  behind  ^^normal  forms  .  In  the  second  example, 
we  introduce  the  “near  identity  transformation” 

f  =  g  +  cf- 


If  we  choose  c  =  -(i  +  €)'^  the  equation  gets  mapped  to 

gt  -  -ig-  eg  +  0{g^), 

and  we  see  that  \g\  =  0{€2+^)  without  effecting  the  decay. 

In  this  example,  the  transformation  is  allowed  because,  in  the  definition 
of  the  constant  c, 

1 

C  —  ■  r  ^ 

{i  +  e) 

the  denominator  never  vanishes  for  small  e.  In  more  general  examples,  van¬ 
ishing  or  small  denominators  can  provide  obstacles  to  the  existence  of  the 
transformation. 

In  our  application  to  the  perturbed  NLS  equation,  we  return  to  equation 
(5.4)  and  focus  upon  the  “/  equation”, 

ift  —  Lcf  4-  Vtf  +  ^Qsif)  4"  ^3(‘^5  /?  ^) • 

Here  we  consider  the  possibility  of  transforming  away  the  quadratic  nonlin¬ 
earity  Qsif)  with  a  near  identity  transformation  of  the  form 

g  =  f  +  K{f,f)  ^  __  (5-13) 

K{f,h)  :=  Ku{f,h)  +  Kii{f,h)  +  I<h{f,h)  + Kii{f,h)  , 
where  K  are  bounded  bilinear  maps  K  ;  11^  X  fl,.  I-  fl^ 


KnU,  h) 

KiiiLh) 
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with  similar  expressions  for  and  /fjj.  In  terms  of  Fourier  expansions 
these  bilinear  maps  can  be  written  as 

Ku{f,h)  =  kii{k,e)fik)h{e)e'^’^+‘^^ 

k+i^O 

Knif,h)  =  E  kn{k,i)f{k)l{-i)e^^>^+^^\ 

k-\-£^0 

In  this  pde  setting,  vanishing  denominators  which  would  be  obstructions 
to  the  transformation  take  the  form  of  “resonances”  for  the  linear  equation 

idtf  =  fxx  +  2^;^  (/  +  /); 

that  is,  simultaneous  zeros  of  the  pair  of  equations 

ki-\-k2  =  k^ 
ih  r22  =  ±  ^3  , 

where  the  dispersion  relation  Clj  =  Q{kj)  is  given  by  equation  (5.9).  However, 
some  algebra  shows  that,  in  the  space  H^  (  where  A;  /  0),  these  equations 
have  no  solutions;  that  is,  there  are  “no  quadratic  resonances”. 

Thus,  a  near  identity  transformation  exists  which  transforms  away  the 
quadratic  nonlinearity.  In  fact,  this  transformation  can  be  found  explicitly: 

R\i{k,e) 

kiiik,^) 

k-nik,e) 

krxik,e) 

Note  that  A:  ^  0,  ^  0,  and  i  k  ^  0  since  we  are  in  the  space  H^. 

Moreover  since 

Y\kab{e,k)\‘^<<X>, 

we  have  K  G  {S^  x  5^),  and  this  implies  that  K  is  a  bounded  bilinear 
map  on  H^ 

for  all  /  G  n^.  Finally  we  can  invert  the  equation 

9  =  f^KUJ) 
for  /  in  a  neighborhood  of  the  zero  to  obtain 


OJ 

M 


UJ 


^(A:  +  £) 

(jj 

k{k-^e) 


=  0  . 
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where  K,  is  of  order  0{g^). 

Applying  this  near  identity  map  places  the  full  equations  in  their  final 
form.  The  transformation 

g  =  f  +  K{f,f). 

will  eliminate  the  Qz  term,  but  it  will  introduce  new  quadratic  terms  in  the 
equation  for  g  that  have  £  coefficients  such  as 

eDKifJ), 

due  to  the  presence  of  terms  like  £  B  /  in  the  /  equation.  Therefore,  using 
( J ,  0 ,  p)  as  coordinates,  the  equations  near  5^  are  written  as 


Jt  =  -2€[a{J  +  up)  +  \/j  +0)2  cos^]  +  Ni{J,0,g;e) 

0^  ^  -2J  +  €{J  +  ujY'P  sine  +  N2{J,0,g-,e)  (5.14) 

igt  =  L^g  +  Wcg  +  N3{J,e,g;e) . 

where,  for  gr  in  a  neighborhood  of  zero,  we  have 

NiiJ,e,g;e)  =  0{eg^) 

N2iJ,e,g-,e)  =  0{g^) 

N3{J,e,g-,e)  =  OiJg^  +  eg^  +  g^) 

Finally,  since  we  will  be  working  with  invariant  real  manifolds  in  a  neigh¬ 
borhood  of  the  circle  of  fixed  points  S^,,  it  will  be  convenient  to  introduce  a 
real  coordinate  system: 

u  =  {Re{g),Im{g))'^. 

In  terms  of  these  variables  the  above  equation  takes  its  final  form: 


Jt  =  -2e[a(J +  uj'^)  +  \/T+^  cosB]  +  Ni{J,e,u;e) 

Bt  =  -2J^e{J+uP)-^l^s\ne  +  N2{J,e,u-,e)  (5.15) 

ut  =  L(U-\-VfU-\-  N3(J,e,u]e). 


Here  Nz  is  interpreted  as  a  two  vector , 

L,  =  Jdl  -  Auj'^S  +  eb 


Ve  =  —  4  J  <5  +  € 


sin  9 


^/TV^ 


J. 


where 
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5.2  Local  Arguments  (Near  He) 

In  this  section  we  describe  the  local  parts  of  the  argument.  We  will  be  inter¬ 
ested  in  the  fate  of  the  manifolds 


=  span{eu}'^ 

=  span{es}^ 

E''{S<^)  =  span{eu , 

under  nonlinearity  and  perturbation.  It  will  turn  out  that  these  manifolds 
persist  and  become  invariant  manifolds  for  a  “cut-off  flow” : 

E^{S^)  ^  W^  :=M, 

From  the  linear  spaces,  it  is  clear  that  these  persistant  manifolds  will  be  cod 
1,  cod  1,  and  cod  2,  respectively. 

In  order  to  establish  the  existence  of  these  local  invariant  manifolds,  as 
well  as  to  coordinatize  them  in  a  manner  useful  for  singular  perturbation 
calculations,  we  begin  by  introducing  a  “cut-off  flow”:  We  start  by  fixing 
S>0,  and  introducing  a  localization  function 

where  y?  is  and  satisfies 


V’(s)  = 


1,  ki  <  1  \ 

0,  |5|  >  2  /  • 


Then  we  localize  the  full  equations 


Jt  —  — 26[a(J  +  u)^)  y/  J  -\-  cos  T* 

0t  =  -2J  +  e[{J  +  w2)-i/2  sin  e]^s{J)  +  NiiJ,  6,  u;  e)i)s{J,  u) 
ut  =  LcU-\-[V(,u-\- N^{J,9,u]e)]i)s{J,u),  (5.16) 


where 


ips{J,u)  =  '<1>s{\\u\\hi)  ■ 

Note  that  we  do  not  cut-off  the  variable  9]  thus,  the  function  /)  has 
the  effect  of  cutting  off  the  right  hand  sides  whenever  the  phase  point  lies 
outside  a  neighborhood  Us  of  the  circle  S^.  Because  of  this  localization,  the 
right-hand  side  of  equation  (5,16)  has  a  global  Lipshitz  constant  of  order 
0(6  +  5) . 
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This  localization  has  the  effect  of  keeping  the  flow  unchanged  in  a  5- 
neighborhood  of  5a,,  while  changing  the  nonlinear  equations  (5.15)  to  be- 
globally  a  ^-perturbation  of  a  linear  constant  coefficient  system.  Using 
^  y^'jT  3^g  variables  and  the  operator  A  defined  in  equation  (5.12), 

we  can  write  equations  (5.16)  as 

Vu,t  = 

Vs,t  —  -<^1  (w; «)  (5-17) 

Vc,t  =  Avc  +  Ri{v;e), 

where  R^{v ;  e)  and  its  first  derivatives  are  of  order  0((i  +  e)  . 


5.2.1  Persistent  Invariant  Manifolds 

We  will  show  that  the  localized  equation  (5.17)  has  {k  >  3)  invariant  man¬ 
ifolds  that  deform  smoothly  from  and  E^.  In  turn,  for  the  original 

equations,  these  manifolds  will  be  locally  invariant  in  a  (J  neighborhood  of 
5a;. 

First  we  define  local  invariance: 


Definition  5.1  Given  an  open  set  O,  a  manifold  M  is  called  ^'locally  in¬ 
variant  (in  0)''  under  a  flow  if  for  every  open  interval  1  such  that 
F^{q)  C  O,  F^*{q)  €  M  for  one  t^  £  I  F^{q)  e  M^t  e  F 


Then  we  state  the  persistence  theorem: 

Theorem  5.2  There  exist  a  6  neighborhood  Us  of  S^,  an  eo{S)  >  0,  and  an 
integer  k  such  that^e  e  [0,€o),  equation  (5.15)  has  a  locally  invariant  (in 
U$ )  manifold  of  codimension  1, 

'^u  “  hy^(vgj  v^y  ^) 5  (5.18) 


W. 


cs 


where  the  function  is  C'^  in  all  of  its  arguments,  and  2n-periodic  in  9. 
moreover  for  e  —  0,  intersects  tangentially  along 
Similarly,  we  have  a  locally  invariant  manifold  given  by 


wr  = 


(veW 


Vs  —  hs  {Vu  j 


(5.19) 


where  the  function  hs  is  O’"  in  all  of  its  arguments,  and  2w-periodic  in  9. 
moreover  for  e  —  Oj  intersects  E^^  tangentially  along  3,^,. 

The  existence  of  a  codimension  2  “slow  manifold”  Me  is  then  given  by 
the  following: 
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Corollary  5.1  Let  Me  denote  the  intersection 

Me  =  f] 

Then  Me  is  a  locally  invariant  (in  Us)  manifold  of  codimension  2, 

Me  =  Vu  =  hi  [vc]  e),  Vs  =  hi  €)|  (5.20) 

where  the  functions  hl  ^  are  in  their  arguments^  and  27: -periodic  in  9. 
Moreover  for  €  =  0,  intersects  tangentially  along 

There  are  essentially  two  methods  to  prove  such  a  persistence  theorem  - 
the  “graph  transform  method”  of  Hadamard  [24]  and  the  “integral  equation 
approach”  of  Perron  [57].  The  graph  transform  approach  for  the  NLS  pde  is 
developed  in  [45].  (This  approach  is  developed  in  generality  for  dissipative 
pde’s  in  [7];  the  integral  equation  approach  is  developed  for  pde’s  in  [6].) 
Here,  since  our  NLS  equation  may  be  viewed  as  a  perturbation  of  a  constant 
coefficient  linear  equation,  we  prefer  the  integral  equation  approach.  While 
it  is  less  geometric,  it  is  more  explicit  and  more  concrete  than  the  graph 
transform  method. 

Remark  5.2  In  this  near  conservative  setting,  the  manifolds  M  and  We 
are  naturally  infinite  dimensional.  (In  addition,  their  small  codimension  is 
important  for  our  “shooting  ”  arguments.)  These  manifolds  are  not  compact; 
hence,  dissipative  methods  such  as  those  in  [7]  do  not  apply. 

First  we  rewrite  equation  (5.17)  in  integral  form: 

rt 

Vu{t)  =  exp[(7l{t-t^)]v^{t^)+  exp  [<(<-s)]jR^(v(s)  ;£)</« 

Jtu 

Vs{t)  =  exp[-<T'(t  -  +  /  exp[-cr‘(i-s)]/?f(v(s);£)<is 

Jt, 

ft 

Vc{t)  =  exp[Ai]uc(0)  4-  /  exp  [A(t  -  5)]  (i;(5);  e)  (5.21) 

Jo 

Most  solutions  of  these  equations  will  grow  as  exper^^  as  i  --)•  +00.  How¬ 
ever,  solutions  in  the  “center  stable  manifold”  cannot  grow  this  rapidly. 
In  fact,  the  gap  in  the  growth  rates  allow  one  to  characterize  the  invariant 
manifold  by 

W‘^  =  lveW  sup  f  exp  —t  ||F*(u;£)||jy  )  <  oo| ,  (5.22) 

(  t>o\  L^oJ  /  J 

where  F^(u;  e)  is  the  cut-off  flow  of  equations  (5.17)  and  Hq  is  a  fixed  positive 
integer.  Thus,  for  v  G  we  have 

exp[-cr^  tu]  \vu  (4)1  — >  0 


as  4  — y  -|-oo. 
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Therefore  for  solutions  on  the  integral  equation  can  be  written  as 

Vuit)  =  /  exp[<(i-s)] 

J+oo 

Vs{t)  =  exp[-«Tji]ws  +  ^  exp  [-cT^(i  -  s)]  i?f(w(s);  e)  ds 

Vc{t)  =  exp[At]vc  +  exp[A{t-s)]Ri{v{s)-,e)ds,  (5.23) 

The  persistence  theorem  is  then  established  with  Newton  iterations,  using 
the  space-time  norm 

||u||a  =  supexp{[-(Tt/A]||t)(t)||//i} . 
t>0 

Once  convergence  has  been  established,  the  function  which  represents  the 
manifold  as  a  graph, 

is  given  by 

rO  * 

■yc;  e)  =  =  0)  =  /  exp[<(t-s)]fl„(u(s);€)ds. 

Remark  5.3  The  smoothness  part  of  the  argument  is  slightly  more  delicate, 
for  an  interesting  reason.  While  solutions  in  cannot  grow  as  expcr^t  as 
t  -)-oo,  they  can  grow  slowly  (compared  with  expcr^t).  This  slow  growth, 
together  with  a  quadratic  nonlinearity,  causes  the  estimates  of  the  iterates 
associated  to  derivatives  to  degenerate.  Fortunately,  the  gap  in  the  spectrum 
still  allows  these  terms  to  be  controlled.  In  fact,  the  size  of  the  gap  determines 
how  many  derivatives  the  manifolds  possess.  (See  [44]  for  details.) 

5.2.2  Fiber  Representations  of  the  Normally  Hyperbolic  Invariant 
Manifolds 

This  gap  in  the  spectrum  also  indicates  the  presence  of  two  distinct  time 
scales  in  the  problem  -  motion  on  the  plane  Ylc  experiences  slow  expansion 
and  contraction  on  the  vt  time  scale,  where  p  -  i/e,  while  motion  off  the 
plane  expands  and  contracts  on  the  fast  scale  t.  These  two  distinct  scales 
make  the  long  time  behavior  singular,  and  necessitate  representations  of  the 
invariant  manifolds  appropriate  for  singular  perturbation  calculations.  Such 
representations  were  introduced  by  Fenichel  [17],  [18],  [19],  [20]  for  finite 
dimensional  dynamical  systems.  In  this  finite  dimensional  setting,  recent 
uses  of  these  “fiber  representations”  include  [37],  [40],  [38],  [27],  [39],  [34], 
[33].  Surveys  of  the  finite  dimensional  situation  are  [35]  and  [63].  There  is 
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Hbeis 


Unperturbed  Perturbed 

Figure  5.1:  Orbits  and  Fibers 


not  much  literature  concerning  these  representations  for  pde’s.  Some  earlier 
work  includes  [12],  [41],  [44],  [7],  [45]. 

For  intuition  about  these  Fenichel  Fibers,  consider  the  following  trivial 
example  with  two  time  scales: 

vt  =  -y 

Xt  =  -v{x-\-y), 

where  0  <  z/  <<  1.  Clearly,  the  orbits  behave  very  differently  when  u  =  0 
than  when  z/  >  0.  (See  Figure  5.1.)  For  example,  when  u  —  0^  the  x 
axis  is  a  line  of  fixed  points,  while  for  z/  >  0,  the  origin  is  the  only  fixed 
point.  However,  in  this  latter  case  {u  >  0)  the  x  axis  remains  an  invariant 
manifold  on  which  the  motion  experiences  slow  contraction.  Clearly,  the 
orbits  behave  singularly  as  z^'  — >  0.  Nevertheless,  the  center  stable  manifold 
behaves  smoothly  with  z/,  but  its  representation  as  a  union  over  orbits  would 
not  display  this  smoothness.  We  seek  an  alternative  representation  in  terms 
of  fibers. 

The  “stable  fiber  through  the  base  point  m  on  the  slow  manifold  Ad” 
is  defined  as  that  equivalence  class  of  points  z  (initial  conditions)  with  the 
property  that  the  trajectories  initialized  at  z  rapidly  approach  trajectories 
initialized  at  m.  For  our  example,  this  means  that,  for  m  G  Ad, 

II  F^{z;  u)  —  F^(m;  z/)  ||  =  0[exp  {—t)]  as  t  +oo|  .  (5.24) 

In  this  trivial  example,  one  can  calculate  explicitly:  Set  m  —  {xrm  0)  and 
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^  =  (x^y)  and  obtain 

u)  =  ^exp  {-ut)  [x  -  y]  +  0[exp  (-f)],  exp  (-t)  y 
F*(TO;t')  =  (exp(-i/t)a;„,0); 

from  which  one  sees  that  the  fiber 


Clearly,  in  contrast  to  orbits,  these  fibers  are  smooth  with  i/. 

This  example  can  also  be  used  to  describe  an  alternative  view  of  the  fibers. 
One  seeks  a  coordinate  transformation  which  diagonalizes  the  slow  flow  in  the 
sense  that  the  equations  for  the  slow  variables  do  not  depend  upon  upon  the 
fast  variables.  In  the  context  of  this  simple  example,  one  changes  variables 

{x,y)  {w,y) 

so  that,  in  terms  of  the  new  variables,  the  equations  take  the  form 

yt  =  -y 

Wt  =  —I'W. 

The  transformation  which  accomplishes  this  is 

x  =  f{w,y),  f{w,0)  =  w, 

where  ^ 

f{w,y)  =  w  + 

In  the  new  coordinates  (lu,  y),  called  '‘fiber  coordinates’",  the  fibers  have  been 
“straightened  out” , 

=  {(W,3/)|W  =  0}. 

In  the  original  coordinates  {x,  y)  the  transformation  function  provides  a  rep¬ 
resentation  of  the  fiber.  Indeed,  fix  a  base  point  m  =  (w,0).  Then  the  fiber 
through  m  is  given  by 

»  =  /(w,y)  =  W  + 

Once  a  flow  is  partially  diagonalized,  it  is  clear  that  “points  on  a  fiber 
suffer  the  same  fast  fate  as  that  fiber’s  base  point  on  the  slow  manifold” ;  that 
is  (for  this  example), 

F\Tm)  =  F*{m)  +  0[exp  (-t)],  t  -l-oo 
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In  the  literature,  the  prevalent  view  of  fibers  is  the  geometric  one  of 
an  equivalence  class  of  points,  primarily  because  of  the  pioneering  work  of 
Fenichel.  Here  the  perturbed  NLS  equation  can  be  viewed  as  a  perturbation 
of  a  constant  coefficient  linear  problem.  Because  of  this  simplifying  feature, 
we  prefer  the  alternative  view  as  a  partial  diagonalization  of  the  flow  which 
decouples  the  slow  dynamics.  In  this  NLS  setting  [44],  this  view  allows  us  to 
construct  the  fibers  with  an  integral  equation  framework  which  is  very  similar 
to  the  Perron  method  that  we  used  for  persistence.  (The  original  “equiva¬ 
lence  class”  point  of  view  is  implemented  for  perturbed  NLS  equations  in 
[41],  [45].) 

With  integral  equation  methods,  one  [44]  can  establish  the  following  fi- 
bration  theorem: 

Theorem  5.3  For  all  €  G  [0,  €o]  ihe  manifold  admits  ,  through  a 
transformation f  coordinates 

ric  e 

Vu  e  [-7/0,  Vo] 

such  that  the  submanifold  M.^  corresponds  to  r/  =  0,  and  the  flow  on 
decouples  in  the  following  manner: 

flu  =  K  +  r^(T/„,j?c;£)]»? 

f]c  =  Aric  +  Sliricie) 

where  P^,  and  their  first  derivatives  are  of  order  0{e  -h  (^).  The  trans¬ 
formation  itself  is  also  with  respect  to  parameters  such  as  e,  A  similar 
statement  holds  for  , 

The  proof  of  this  theorem,  which  proceeds  by  integral  equation  methods, 
may  be  found  in  [44].  It  involves  comparing  solution  trajectories  through  an 
arbitrary  point  with  trajectories  through  a  base  point  on  the  slow  manifold. 
This  comparison  requires  enough  preliminary  notation  that  the  argument, 
while  straight  foreward,  is  somewhat  tedious.  As  in  the  argument  for  persis¬ 
tence,  the  smoothness  part  is  more  delicate  than  existence. 

Remark  5.4  One  reason  for  the  loss  of  derivative  is  due  to  the  simple  iter¬ 
ation  procedure  that  we  use  to  show  the  persistence  and  the  fibration  of  the 
manifold.  We  use  a  simple  Newton’s  iteration  procedure  applied  to  functions 
defined  on  noncompact  domain. 

These  fibers  allow  general  motion  to  be  tracked  by  following  only  motion 
through  the  base  point  on  the  slow  manifold.  In  addition,  they  are  nicely 
behaved  with  respect  to  the  parameter  €,  as  well  as  with  respect  to  the  base 
point.  As  such,  they  are  very  useful  in  singular  perturbation  calculations. 
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5.2.3  Height  of 

The  final  local  information  concerns  the  “height  of  the  stable  manifold  W^{Qy\ 
On  the  plane  11^  this  manifold  is  controlled,  globally,  with  the  energy  of  the 
driven  pendulum  (3.10).  Here  we  discuss  an  estimate  of  its  height  over  the 
plane. 

The  near  identity  or  normal  form  transformation  discussed  earlier  in  Sec¬ 
tion  5.1.4  enables  us  [44]  to  use  integral  equation  methods  to  establish  the 
following 

Theorem  5.4  The  point  Q  has  a  local  stable  manifold  in  Me  which  can  be 
parametrized  by  (^,  Vo); 

w  =  {(J,  e,  vo)\J  =  f{o,  Vo)} 

for  all  ||vo||f/i  €  [Oj 

Remark  5.5  Note  that,  in  this  theorem,  one  is  restricted  to  lie  within  the 
slow  manifold  Ait  , 

W  =  W%Q)nMt. 

The  theorem  states  that  the  “wall”  W  is  tall  enough,  to  permit  a 

successful  “shooting  argument”  in  the  second  measurement. 

5.3  Global  Arguments 

With  this  material  about  the  local  behavior  near  the  plane  Uc  in  hand,  we  can 
turn  to  the  global  issues  from  integrable  theory,  and  to  the  first  and  second 
measurements. 

5.3.1  Integrable  Theory 

The  global  material  from  integrable  theory  has  already  been  discussed  in 
Section  3.2.  It  consists  in  representations  of  homoclinic  orbits,  a  natural 
constant  of  the  motion  F,  and  a  Melnikov  function  Mf  which  is  based  upon 
F.  Here  we  restrict  ourselves  to  a  discussion  of  the  reason  the  invariant  F  is 
so  natural  an  invariant  upon  which  to  base  the  Melnikov  function. 

Remark  5.6  1.  For  integrable  cases,  the  explicit  representations  of  the  ho¬ 
moclinic  orbits  provide  beautiful  and  explicit  representations  of  the  Fenichel 
fibers  [41],  [43],  [45]. 

2.  As  discussed  earlier,  F  is  critical  at  unstable  tori;  hence,  F'  will  vanish 
exponentially  as  the  whisker  approaches  its  unstable  torus.  This  provides 
exponential  convergence  of  the  Melnikov  integrals  and  is  one  reason  to  base 
the  Melnikov  function  on  F.  However,  this  convergence,  which  itself  is  due  to 
geometric  reasons,  is  not  the  main  structural  reason  that  F  is  natural. 
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Recall  that,  for  the  integrable  NLS  equation,  the  center-stable  manifold  of 
the  circle  of  fixed  points  admits  the  following  global  characterization: 

WS^{S^)  =  {qeHljF  =  -2}. 

Thus,  gradF  is  normal  to  fTQ^(5u;).  As  this  manifold  is  cod  1,  the  distance 
of  a  general  point  from  it  can  be  assessed  with  one  measurement  in  the 
gradF  direction.  Under  perturbation,  the  manifold  Wq^{Su,)  persists  to  the 
manifold  Since  this  deformation  is  smooth  (^t  least  C^),  gradF  remains 
transversal  to  the  perturbed  manifold  As  this  perturbed  manifold  is  also 
cod  1,  one  can  use  the  gradF  direction  to  develop  a  “signed  distance”  of  a 
general  point  from  Indeed,  this  is  the  Melnikov  function  of  the  first 

measurement! 

5.3.2  First  (Melnikov)  Measurement 

Melnikov  analysis  was  first  used  for  pde’s  by  Holmes  and  Marsden  [30],  [29] 
to  establish  the  transversal  intersection  of  the  stable  and  unstable  manifolds 
when  one  of  them  is  one  dimensional.  However,  when  the  perturbation  intro¬ 
duces  new  slow  saddle  directions  (i.e.,  when  the  problem  is  singular),  their 
methods  are  not  sufficient  to  establish  the  transversal  intersection  of  these 
manifolds.  For  a  truncation  of  NLS  to  a  singular  problem  in  four  dimensions, 
Kovacic  and  Wiggins  [37],  [40]  were  the  first  to  combine  Fenichel  fibers  with 
Melnikov  analysis.  We  [41],  [44]  then  developed  this  combination  for  pde’s. 

Consider  the  perturbed  NLS  equation  in  the  form 

qt  =  iH'{q)  +  eG{q) 

where  H'{q)  =  —  fe  — 2 [g?— q  and  G{q)  =  —aq  —  ^Bq—l.  In  this  “first 
measurement”  we  [44]  will  construct  a  Melnikov  function  A  whose  zeros  cor¬ 
respond  to  orbits  that  do  not  lie  in  the  invariant  plane  He,  are  asymptotic  to 
the  saddle  point  Q  in  backward  time,  and  asymptotic  to  the  locally  invariant 
manifold  A4e  in  forward  time.  This  orbit  is  of  the  type  depicted  schemati¬ 
cally  in  Figure  5.2.  Notice  that  as  the  orbit  leaves  the  saddle  Q,  the  phase 
point  moves  very  slowly  and  remains  near  the  plane  H^  for  a  long  time.  Then 
it  suddenly  “takes-off”  and  rapidly  flies  away,  only  to  return  near  the  slow 
manifold  Mt)  which  it  slowly  approaches  in  foreward  time.  The  very  slow 
motion,  on  both  the  “take-off”  and  “landing”  sides,  will  be  described  with 
Fenichel  Fibers. 

The  argument  begins  by  considering  the  unstable  manifold  W'^{Q)  of  the 
saddle  Q.  This  manifold  is  two  dimensional,  with  a  slow  direction  in  the 
plane  Uc  C  Me  and  the  other  fast  direction  off  of  the  plane,  tangent  to  cos  a;. 
On  the  plane  He,  this  unstable  manifold  is  a  curve  which  is  controlled  by 
the  energy  (3.10)  of  the  driven  pendulum.  This  curve  lies  near  the  circle 
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Figure  5.2:  Schematic  Diagram  of  the  Homoclinic  Orbit 


of  fixed  points  and  can  be  viewed  as  a  graph  over  this  circle.  That  is,  the 
curve  can  be  parameterized  by  an  angle 

Since  W'^iQ)  is  a  submanifold  of  it  inherits  the  Fenichel  Fibration 

over  the  curve  C^: 

W^{Q)=  U 

m€C“ 

In  particular,  we  will  fix  a  base  point  Of,  on  the  curve  C“,  and  consider  two  dis¬ 
tinct  fibers  each  with  this  same  base  point  -  the  perturbed  fiber 
its  unperturbed  neighbor  (See  5.3  Fixing  a  short  positive  distance 

J  >  0  along  these  fibers  enables  us  to  define  the  unperturbed  and  perturbed 
“take-off”  points,  7^  and  TS  each  of  which  is  a  function  (through  m)  of  the 
base  angle 

r"  =  j^(<5.;0) 

r  =  (5.25) 

In  this  manner,  we  have  used  fibers  to  describe  the  slow  motion  and  have 
made  a  precise  identification  two  take-off  points,  one  on  the  unstable  manifold 
W“((5)  and  the  other  on  the  unperturbed  manifold  IFo  (lie)  =  IFo(nc). 
Next,  we  consider  two  orbits  initialized  at  these  take  off  points: 

q^it)  =  F‘(T«;0) 

q,it)  =  F‘(r;£).  (5.26) 

The  orbit  qh{t)  is  an  unperturbed  integrable  orbit,  which  certainly  asymptotes 
to  the  plane  lie  in  foreward  time.  In  fact,  analytical  expressions  precisely  and 
concretely  determine  its  fate  on  the  plane.  On  the  other  hand,  the  orbit  qe{t) 
is  a  solution  of  the  perturbed  NLS  equation.  Our  goal  is  to  determine  if  a 
take-off  angle  can  be  chosen  to  guarantee  that  this  orbit  will  asymptote  to 
the  slow  manifold  as  i  — >•  +oo. 
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Figure  5.4:  Schematic  Diagram  of  the  First  Measurement 

From  the  explicit  expression  for  the  integrable  homoclinic  orbit  qh  {t) ,  one 
sees  that  there  is  a  time  T^{S)  such  that  dist{qh{t) ,  flc)  <  S  for  all  t  >  T*.  At 
Th.  we  define  two  “landing  points” 

4  =  qhiT.)  =  F^*(r°;0) 

4  =  9.(T.)  =  F^-(r;£). 

By  finite  time  perturbation  theory,  these  two  landing  points  are  0(e)  apart. 
The  unperturbed  landing  point  4  €  VFq  (lie).  We  hope  to  choose  the  take-off 
angle  0^  so  that  the  perturbed  landing  point 

The  manifold  is  codimension  1  and  is  e  close  to  the  integrable 
moreover,  the  latter  is  characterized  by  {q  G  H^\F{q)  +  2  =  0}.  Therefore, 
we  can  define 

as  a  measure  of  the  signed  distance  between  4  and  gs,  where  denotes  the 
intersection  of  the  line  through  4  and  the  manifold  .  (See  Figure  5.4). 
To  actually  calculate  A,  we  define,  for  t  <  0,  the  orbits 

g*(^)  =  qh{t-\-T^) 

Qui^)  =  ge(t  +  T*). 
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Similarly,  for  f  >  0,  we  define  qs{t)  to  be  the  solution  of  the  perturbed  problem 
with  initial  data  and  we  extend  the  definition  of  q^{t)  to  all  t.  These  orbits 
allow  us  to  introduce  the  definitions 

A“(f)  =  {F'{q^{t),  qu{t)  -  ^*(0  )  ^  ^  0 

A+(t)  =  {F\q4t),qs{t)-q,{t))  t>0 

A  =  A"(0)--A+(0) 

Finally,  as  in  the  finite  dimensional  case  [62],  differential  equations  for  A“(f) 
and  A+(i)  enable  us  to  estimate  the  signed  distance  A  in  terms  of  the  Mel¬ 
nikov  integral: 

Proposition  5.1  The  distance  A  is  given  by 

A  =  eMF{0b)+O{e'^), 

where 

Mf=  {F'{q4t)),G{q4t)))dt. 

J  —  OO 

The  detailed  proof  of  this  estimate  may  be  found  in  [44]. 

This  proposition  establishes  the  geometric  significance  of  the  Melnikov 
function  which  was  calculated  in  Section  4.2,  For  the  readers  convenience, 
we  summarize  the  results  of  those  calculations.  Let  quj{t)  denote  an  orbit  of 
the  integrable  pde  which  is  homoclinic  to  the  circle  of  fixed  points  Su>  and  >/e 
close  to  the  integrable  homoclinic  orbit  (f) . 

Corollary  5.2  The  distance  A  has  an  expansion  in  e  given  by 
A  =  eMFia, 

/OO 

{F'(qUt)),G{qAt)))dt 

-OO 

=  -  [aM„  +  pMf}  +  Mo  cos  -  2p„)]  (5.27) 

where 

/OO 

{F'iqUt))> 

-OO 

/OO  ^ 

{F'{q^{t)),  Bqujit))dt 

-OO 

/OO 

{F'(qu>{t)),  1)  dt  . 

-OO 


MELNIKOV  ANALYSIS  FOR  PDE’s 


91 


More  explicitly, 


—  dr  d  sech  r 

7-00  Jo  (tA^ 

[sech  r  +  sin  Po  tanh^  r  cos  a;  -  sin^  Po  sech  r  (2  +  cos^  x) 
+2  sin^  Po  sech  cos  x] 


,  47ra;^sin^Po  sech  r 

" 

j^sinpo  sech  rcosx  -sin^Po  sech  ^r(l  +  sin^  a:)j  x 
[2  sech  T  —  sinpo  sech  ^rcosx  —  2sin^Po  sech  r 
+2sin^Po  sech  cosx]  +  0(sin^“^  po) 


^  ,  47ro;sin^Po  sech  r 

dr  /  dx - - X 

•oo  Jo  crA^ 


Mo  =  [  dr  [  dx 

J—oo  Jo 

[  sech  r  —  sin  po  cos  x] 


where  po  =  tan“^  \/a;^  —  1,  A  =  [\  —  sinpo  sech  rcosa:]  and  where  the 
0(sin^“^  Po)  term  in  the  Mp  equation  is  due  to  the  fact  that  we  used  —d^ 
instead  of  B  in  our  computation. 

Assuming  that  of  Mp  are  nonzero,  then  the  function 


M(a,  13,  6h)  =  aMa  +  fJMp  4-  Mq  cos(^i,  ~  2po) 


(5.28) 


has  nondegenerate  zero.  This  implies  that  we  can  choose  our  parameters  so 


that  A  =  0,  i.e,, 


^  qs  €  VTf  , 


and  the  first  measurement  is  complete. 


5.3.3  Second  Measurement 

As  a  consequence  of  the  first  measurement,  the  “take-off  angle”  can  be 
chosen  so  that  the  landing  point  4  lives  in  ,  and  thus  the  orbit  asymptotes 
to  Mf  However,  we  do  not  know  if  it  asymptotes  to  the  saddle  point  Q.  In 
order  to  ascertain  that  it  does  return  to  Q,  we  perform  a  second  measurement. 

The  necessity  of  a  second  measurement  was  first  understood  in  finite 
dimensional  discretizations  of  this  NLS  system  -  first  for  a  four  dimensional 
system  [38], [53],  and  then  for  the  more  delicate  2^" -h  2  dimensional  system 
[41],  [43], 

This  second  measurement  makes  substantial  use  of  Fenichel  Fibers.  In 
order  to  determine  the  foreward  fate  of  the  orbit,  one  does  not  follow  the 
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orbit  through  the  landing  point  4-  Rather,  one  follows  the  orbit  through  the 
base  point  of  that  stable  fiber  on  which  4  resides. 

This  second  measurement  is  also  a  shooting  argument.  While  the  stable 
manifold  W^{Q)  is  cod  2  as  a  submanifold  of  the  entire  space,  it  is  cod  1  as  a 
submanifold  of  .  Moreover,  as  a  consequence  of  the  first  measurement,  the 
landing  point  4  resides  in  Wf .  Thus,  a  one  dimensional  shooting  argument 
should  suffice  for  the  second  measurement. 

Denote  the  base  points  of  the  landing  fibers  in  the  perturbed  and  unper¬ 
turbed  cases  by  and  bo,  respectively: 

4  e 

4  €  ^4(-;0). 

The  unperturbed  base  point  is  known  to  lie  on  the  plane  bo  G  He.  On  the 
other  hand,  the  perturbed  base  point  is  only  restricted  to  reside  in  the  cod 
2  slow  manifold  b^  E  (In  the  four  dimensional  model  problem  [53],  [38], 
both  base  points  lie  on  the  plane  lie  which  makes  the  second  measurement 
easier.)  However,  because  of  the  smoothness  of  the  fibers  with  respect  to  €, 
we  do  know  that  the  two  base  points  b^  and  bo  are  e  close. 

The  issue  is  whether  the  base  point  b^  €  Ale  actually  lies  on  the  stable 
manifold  W  =  In  [44],  we  establish 

Proposition  5.2  The  distance  from  the  landing  base  point  b^  to  the  stable 
manifold  W  can  be  measured  as  a  function  of  the  take-off  angle  9h  by 

d  =  lo]2  sin  2po  cos (^6  -  2po)  -f  AaLopf\  +  O  {y/e) 

The  proof  of  this  second  measurement  involves 

1.  the  smoothness  of  the  fibers  to  estimate  that  the  perturbed  and  unper¬ 
turbed  base  points  are  close; 

2.  the  energy  (3.10)  of  the  driven  pendulum  to  measure  distances  on  the 
plane  He; 

3.  the  height  of  the  wall  W  as  estimated  by  the  normal  form  calculation 
to  ensure  that  projections  on  the  plane  accurately  capture  the  landing 
base  point  as  it  oscillates  across  the  wall  (as  a  function  of  the  shooting 
parameters” ) ; 

4.  the  explicit  formula  for  the  integrable  phase  shift  which  expresses  the 
landing  angle  as  a  function  of  the  take-off  angle 

Details  of  this  second  measurement  are  described  in  [44]. 
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In  summary,  the  first  and  second  measurement  produce  a  pair  of  equations 
which  must  be  satisfied  for  a  homoclinic  orbit  to  persist: 


cos(^6  -  2po)  =  - 

P=-a  [m„  + 

Sin  2pc 


2au>Po 
sin  2po 


Mo  j  Mp 


(5.29) 

(5.30) 


By  the  implicit  function  theorem,  for  e  small  we  can  solve  d  =  A  =  0  in  a 
small  neighborhood  of  the  point  given  by  equations  (5.29)  and  (5.30).  Finally, 
/3  can  be  shown  to  be  positive  (so  that  the  perturbation  is  indeed  dissipative) 
either  numerically  for  wide  range  of  parameter  values,  or  with  a  completely 
analytic  argument  if  we  assume  that  p^  —  tan“^  \/4cj2  _  \  is  small  enough 
i.e.,  u  close  to  1/v^. 

The  simultaneous  zeros  of  d  and  A  determine  (i)  the  take-  off  angle  Oh 
as  a  function  of  the  external  parameters  and  (ii)  one  relationship  amongst 
the  external  parameters.  Thus,  the  homoclinic  orbit  is  shown  to  persist  (as  a 
singular  deformation  of  the  integrable  orbit)  provided  the  external  parameters 
are  restricted  in  a  cod  1  manner. 


Remark  5.7  Notice  that  both  measurements  are  used  together  to  remove 
the  “cut-off”.  The  first  measurement  establishes  a  heteroclinic  connection 
between  the  saddle  Q  and  the  slow  manifold  a  manifold  which  has  only 
been  shown  to  persist  for  the  cut-olf  flow.  However,  the  second  measurement, 
which  shows  that  the  orbit  returns  to  the  saddle  Q,  can  be  used,  together 
with  control  provided  by  the  unperturbed  homoclinic  orbit,  to  show  that  the 
orbit  homoclinic  to  Q,  once  it  returns  near  never  leaves  the  region  where 
the  cut-off  function  is  the  identity.  In  this  manner,  the  theorem  establishes 
the  existence  of  a  orbit  homoclinic  to  the  saddle  Q  for  the  original  perturbed 
NLS  equation,  without  any  cut-off. 


Remark  5.8  Once  the  persistence  of  one  homoclinic  orbit  is  established,  the 
symmetry  x  x-j-rr  shows  the  persistence  of  a  second  homoclinic  orbit.  One 
member  of  this  pair  represents  a  coherent  spatial  excitation  located  at  a:  =  0, 
and  the  other  represents  an  excitation  at  a:  =  tt.  In  addition,  equations 
(5.29)  and  (5.30)  can  possess  several  distinct  solutions  in  some  parameter 
ranges,  each  with  its  symmetric  partner.  Thus,  the  analysis  shows  that  many 
homoclinic  orbits  can  persist  for  the  perturbed  NLS  equation. 


6  Remarks  on  Symbol  Dynamics 

The  simplest  chaotic  behavior  which  was  observed  in  the  numerical  experi¬ 
ments  for  the  perturbed  NLS  equation  consisted  of  a  single  solitary  spatial 
excitation  which  jumps,  irregularly  in  time,  between  the  two  distinct  spatial 
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locations  at  a;  =  0  and  x  =  n.  These  numerical  experiments,  together  with 
the  persistence  of  a  symmetric  pair  of  homoclinic  orbits,  suggests  a  ‘‘symbol 
dynamics”  explanation  of  this  phenomena. 

More  precisely,  the  term  “  symbol  dynamics”  refers  to  the  existence  of 
an  invariant  set  in  the  phase  space  which  is  topologically  equivalent  to  a  set 
of  all  symbol  valued  sequences.  In  our  setting,  these  sequences  would  take 
the  values  of  C  (center)  or  E  (edge),  and  the  dynamics  is  represented  as  a 
shift  on  this  sequence  space.  As  such,  the  dynamics,  when  restricted  to  the 
invariant  set,  is  as  random  as  a  sequence  of  “coin  tosses”. 

In  finite  (usually  very  small)  dimensional,  the  existence  of  such  an  invari¬ 
ant  set  is  established  by  constructing  a  “Smale  horseshoe”,  [59],  [55],  [23], 
[62].  Such  constructions  are  carried  out  for  orbits  homoclinic  to  the  saddle 
Q  for  the  four  dimensional  truncation  [53]  and  for  the  2N  +  2  dimensional 
truncation  in  [41],  [46].  A  similar  construction  is  in  progress  for  the  pde. 

Symbol  dynamics  is  very  appealing  because  it  demonstrates  the  existence 
of  chaotic  motions  which  last  for  all  time.  However,  it  has  some  drawbacks. 
First,  it  occurs  on  a  very  small  set  in  phase  space,  which  is  not  shown  to  be 
(and  is  likely  not)  a  stable  set.  As  such,  this  type  of  chaos  my  not  be  observ¬ 
able.  Moreover,  the  behavior  depends  on  parameters  in  a  bifurcation  fashion. 
Often  the  parameter  values  required  to  show  the  existence  of  the  horseshoe 
are  very  far  from  the  values  of  the  parameters  at  which  chaotic  behavior  is  ob¬ 
served  in  numerical  experiments.  (For  example,  in  our  analytical  results  [44], 
an  addition  dissipation  /5  >  0  is  required  which  satisfies  a  cod  1  constraint. 
However,  in  the  numerical  experiments  [52],  chaotic  behavior  is  observed  for 
/?  =  0,  over  a  range  of  a  values.)  Finally,  the  construction  of  the  horseshoe 
is  almost  always  done  for  generic  abstract  models,  rather  than  for  a  fixed 
specific  dynamical  system.  To  us,  this  generic  situation  seems  to  be  a  Revere 
limitation  of  the  practice  of  the  method.  (Sometimes  numerical  constructions 
are  used  to  verify  the  generic  assumptions  for  specific  examples,  and  thus  to 
overcome  this  limitation.  However,  in  high  dimensional  singular  situations, 
such  numerical  verifications  are  extremely  difficult  -  if  not  impossible.) 

Recently,  Haller  has  been  developing  an  alternative  perspective  which  he 
has  applied  to  finite  dimensional  discretizations  of  the  perturbed  NLS  equa¬ 
tion  [27],  [25],  and  which  he  is  currently  extending  to  the  pde  [26].  In  his 
work,  using  very  similar  geometric  perturbation  methods  to  those  summa¬ 
rized  here,  he  constructs  a  large  class  of  heteroclinic  orbits  from  the  saddle  Q 
to  (for  example)  the  sink  P.  He  fixes  a  finite,  but  arbitrary,  positive  integer 
N ^  as  well  as  any  sequence  of  C’s  and  E’s  of  length  N,  and  then  establishes 
the  existence  of  a  heteroclinic  orbit  which  recovers  this  pattern.  While  only 
transient  behavior,  the  length  of  the  transients  is  arbitrarily  long.  In  any 
case,  this  set  of  heteroclinic  orbits  certainly  demonstrates  very  complicated 
dynamics  which  depends  sensitively  upon  initial  conditions.  Moreover,  as  the 
second  measurement  is  not  required  to  force  the  orbit  to  return  to  the  saddle 
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(5,  these  heteroclinic  orbits  exist  for  a  full  open  set  of  external  parameter 
values,  without  any  cod  1  restriction. 

7  Other  Equations  and  Related  Work 

The  importance  of  this  work  is  not  in  the  construction  of  one  particular  homo¬ 
clinic  orbit  for  one  specific  equation.  Rather,  it  is  the  development  Melnikov 
methods,  together  with  geometric  singular  perturbation  theory,  for  near  con¬ 
servative  pde’s.  We  believe  that  these  will  become  general  methods  which  will 
be  applicable  to  a  wide  class  of  problems.  For  example,  the  methods  should 
be  useful  for  the  study  (i)  perturbations  of  other  integrable  pde’s  such  as  the 
sine-Gordon  [22],  [8],  and  Davey-Stewartson  equations;  (ii)  discretizations  of 
integrable  equations  [2],  [3],  [51],  [10];  (iii)  systems  without  even  symmetry  [1]; 
(iv)  the  Ginzburg-  Landau  equation  [56],  and  many  others.  Connections  be¬ 
tween  Melnikov  analysis  and  the  methods  of  inertial  manifolds  [13],  [14],  [15], 
should  be  developed.  Melnikov  methods  should  also  be  useful  in  the  study 
of  conservative  perturbations  of  conservative  pde’s,  which  possess  interesting 
transport  and  diflPusion  properties  [21].  Finally,  these  methods  should  also 
play  a  role  in  the  description  of  the  spatial,  as  well  as  temporal,  chaos  [28], 
[11],  [52], 

Clearly  this  work  for  pde’s,  which  combines  Melnikov  methods  with  geo¬ 
metric  singular  perturbation  theory,  is  just  beginning.  The  study  of  pertur¬ 
bations  of  the  NLS  equation  shows  that  the  methods  are  useful  tools  for  pde’s. 
Their  generality,  and  general  usefulness,  is  yet  to  be  determined. 
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Integrable  Hamiltonian  systems 

Percy  Deift,  Courant  Institute 

These  four  lectures  concern  integrable  Hamiltonian  systems.  The  goal  in  the 
lectures  is  to  give  a  rapid  but  elementary  introduction  to  the  subject,  with  a  view 
to  describing  some  recent  results  in  the  field:  the  reader  is  of  course  encouraged 
to  consult  the  References  for  more  details.  The  lectures  are  presented  in  informal 
“blackboard”  style  in  order  to  preserve  the  tutorial  atmosphere  of  the  Summer 
School,  and  are  divided  up  as  follows. 

Lecture  1.  What  is  a  Hamiltonian  system?  What  is  an  integrable  system? 
Lecture  2.  Examples  of  finite-dimensional  integrable  systems. 

Lecture  3.  Examples  of  infinite-dimensional  integrable  systems. 

Lecture  4.  Long-time  behavior  of  integrable  systems. 

Lecture  1.  (References;  [G]  —  a  physics  text,  [A]  —  a  mathematical  physics  text, 
[AM]  —  a  mathematical  view  of  the  foundations  of  mechanics.)  The  space  M  = 
is  an  example  of  a  symplectic  manifold  i.e.,  it  is  an 

•  even  dimensional  manifold,  with  a 

•  non-degenerate  2-form  w,  i.e.,  w{u,  v)  =  0  for  all  v  €  T^M,  implies  u  =  0, 

•  which  is  closed,  i.e.  dw  =  0. 

Exercise:  Show  that  a  symplectic  manifold  must  be  even  dimensional. 

n 

For  the  standard  2-form  is  w  —  dxi  A  dyi.  Clearly  dw  =  0.  If  v  = 

1=1 

^  /  \  t 

^  (6)’  “  (^)’  a  simple  calculation  shows  that  w{v^  v^)  = 

((6)  >  J (y))  >  ’^here  q  )  ’  /  is  the  n  X  n  identity,  and  (•,  ♦)  denotes  the 

standard  inner  product  on  From  this  it  is  clear  that 


w(v,v')  =  0  for  all  v'  =>  =  0  ^  =  0, 


.e.  =  0, 


so  w  is  non-degenerate. 

Functions  (i.e.  ‘Hamiltonians’)  H:  M  — >  M  generate  vector  fields  through  w  in 
the  following  way.  At  any  point  m  e  M ,  dHm  is  a  1-form:  hence  v  ^  dHm[v)  is 
a  linear  map  from  TmM  to  M.  Thus,  as  w  is  non-degenerate,  there  exists  a  unique 
vector  VH{m)  G  TmM  such  that 

(1)  dH{v)  ^  w{vff,v),  for  all  v  e  TmM. 


In  the  case  of  I  u;  =  ^  dxi  A  dyi  ) ,  equation  (1)  becomes 


=  {VB,J 
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Hence  =  J'^vh,  which  implies  vh  =  (j^J-  Thus  H  gives  rise  to  the  standard 
Hamiltonian  vector  field 


dt  \y)  V“^^/ 

The  fact  that  w  is  non-degenerate  plays  an  obvious  role,  as  above.  The  role  that 
the  closedness  of  w  plays,  is  more  subtle.  If  H,  K  are  two  Hamiltonians  on  M,  we 
define  their  Poisson  bracket  {H,  K}  through 

{H,K}  =  w(vh,'Vk)- 

Note  that  {H,K}  =  Clearly  {H,K}  =  dH{vK)  =  vk{H)  =  where 

^  is  the  derivative  of  H  in  the  direction  of  vk-  Thus 

at 

and  in  particular  ^  =  {/C,  K}  =  0,  as  {•,  •}  in  skew  i.e.  K  is  a  conserved  quantity 
for  the  flow  that  it  generates. 

Exercise:  Verify  Leibniz’s  rule,  i.e.  {HK,  L}  =  H{K,  L}  -f  {H,  L}K  for  all  Hamil¬ 
tonians  H,K,L. 


Observe  that  for  ^  ^2/*^  ? 


\Ky 


dxi  dyi  dyi  dxi )  ’ 


which  is  the  standard  Poisson  bracket  on 

Now  it  is  an  (Exercise)  in  the  exterior  calculus  to  show  that  up  to  a  constant 
C,  dty  is  a  3-form  satisfying 

dw{vH,VKi  vl)  = 

for  all  Hamiltonians  H,K^L.  Thus 

w  is  closed  ^  satisfies  the  Jacobi  identity,  i.e. 

{{H,  K},  L}  -f  {{K,  I},  H}  P  {{L,  H},K}  =  0. 

This  leads  to  the  following  critical  calculation  for  the  commutator  [vh,vk]  of 
Hamiltonian  vector  fields: 


[vH,VK]iL)  =  vh{vk{L))  -  vk{vh{L)) 

=  vh{{L,K})-vk{{L,H}) 

=  {{LJ<},H}  +  {{H,L}J<} 

=  —  {{/C,  if},  I/},  by  the  Jacobi  identity, 

=  V{k,h}{L) 
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i.e. 

(2)  bn.'^K]  = 

Thus  the  commutator  of  two  Hamiltonian  vector  fields  is  again  a  Hamiltonian  vector 
field.  Moreover  the  map,  v/j,  is  an  anti-isomorphism  from  the  Poisson  algebra 
of  functions  on  M  with  product  given  by  the  Poisson  bracket,  into  the  algebra  of 
vector  fields  with  product  given  by  the  commutator  of  vector  fields.  Most  impor¬ 
tantly,  we  notice  that  two  Hamiltonian  vector  fields  commute  ^  their  Hamiltonians 
Poisson  commute.  This  is  perhaps  the  most  useful  consequence  of  the  fact  that  w 
is  closed. 


Exercise:  Two  vector  fields  v,  v'  commute  the  flows  that  they  generate 

commute  i.e.,  if  ^  =  v{(g),  ^  then  o  o  . 

To  summarize,  we  see  that  symplectic  manifolds  (M,  w)  are  equivalent  to  non- 
degenerate  Poisson  manifolds  (M,  {•,■})  where  {♦,  •} 

♦  is  bilinear  from  x  C^{M) 


•  satisfies  Leibniz’s  rule  {HK,  L}  =  H{K,  L}  -f  {H,  L}K, 

•  is  non-degenerate,  i.e.  {H,K}  =  0  for  all  functions  K  implies  H  =  const., 

•  satisfies  the  Jacobi  identity. 


Exercise:  Prove  the  equivalence. 

Question:  How  do  symplectic  manifolds  arise?  There  are  three  main  sources  for 
symplectic  manifolds; 

(i)  T*X,  i.e.  cotangent  bundles  of  manifolds 

(ii)  co-adjoint  orbits  of  groups  on  their  dual  Lie  algebras  (Kostant-Kirillov  2- 
forms) 

(iii)  constrained  systems. 


(i)  Let  X  be  a  manifold,  let  x  G  X,  and  let  G  T*X.  Let  i;  be  a  vector  field 
on  T*X.  Let  tt  denote  the  natural  projection  on  T*X  to  the  base  point  x:  thus 
7r(a^)  =  X  and  7r*t;  G  T^X.  Hence,  as  G  T*X, 

0{a^){v)  = 

defines  a  natural  1-form  0  on  T*X.  Then  w  =  dB  defines  a  2-form  on  T*X,  which 
is  clearly  closed,  and  which  can  easily  (Exercise)  be  shown  to  be  non-degenerate. 

Exercise:  On  show  that  6^  =  —  ^  yidxi  and  hence 

i  —  \ 

n 

w  —  dO  ~  ^  dxi  A  dyi . 

izzl 

The  above  construction  is  very  basic  in  mathematics  and  leads  to  the  appearance 
of  symplectic  manifolds  in  many  different  mathematical  situations. 

(ii)  Let  ^  be  a  Lie  algebra  with  Lie  bracket  [•,•].  Thus  [•,  •]  is 
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♦  bilinear,  and 

#  satisfies  the  Jacobi  identity,  i.e.  +  [[2/? d-  [[^?3j],2/]  —  0,  for  all 

x,y,z  e  Q- 

Let  G  be  the  associated  (connected)  group  and  let  g*  be  the  dual  Lie  algebra. 
Then  G  acts  on  G  by  the  Ad-action, 

r  Ad:  g^g 

[  =  geG.xeg. 

Also  G  acts  on  G*  by  adjointness 

(Ad*:g^-^g* 

\  {Ad*ga,x)  =  {a,Adgx)  =  a{Adgx), 

where  (•,  •)  denotes  the  pairing  of  G*  and  G. 

Exercise:  Show  that  Ad*^g^  =  ^^91^^92 ' 

The  co-adjoint  orbit  Oa  through  a  point  of  G  6*  is  given  by 

Oa  =  {Adla:  g  G  G}. 

The  remarkable  fact  is  that  carries  a  non- degenerate  2-form,  and  hence  is  natu¬ 
rally  a  symplectic  manifold  (and  hence  is  always  an  even  dimensional  manifold).  To 
see  what  this  2-form  is  one  can  proceed  functorially.  Vector  fields  on  Oa  through 
the  point  /?  =  Ad^a  G  are  given  by  Ad^.^ld  for  arbitrary  xeG.  Indeed, 


_d 

dt  t=o 


Adl,.^  =  4  Adl,.Ad*a 
®  dt  t-o  ^ 


Ad*g^t.a  G  TaO, 


Question:  How  should  we  define 


dt  u=o 


Adli.p, 


dt  lt=o 


The  only  natural  way  is 

=  /?([*.!/])  =  (A  [*-!/])> 

and  indeed  with  this  definition  (Exercise),  C)„  is  indeed  a  symplectic  manifold. 
Example:  Consider 

G  =  G/+(n,R)  =  G/(n,R)n{M:  detM>0}, 

Q  =  (//(n.lR)  =  Af(n,]R). 

We  can  identify  G*  as  M(ri,R)  through  the  non-degenerate  pairing 

{A,B)=tT:AB 

i.e.  the  matrix  A  induces  a  linear  map  on  5  to  R  through 

B  AB, 
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and  moreover  every  linear  map  i[B)  on  5  to  M  is  of  this  form  for  some  (unique) 

Now  for  ^  E  E  Q,  AdgX  —  ge^^g  =  gxg~^,  where  the  RHS  can  now 

be  taken  to  be  ordinary  matrix  multiplication.  Also 

(Ad*A,x}  =  (A,Adgx)  =  {A,gxg~^)  =  tT  Agxg~^  =tT  g~'^Agx 
=  {g~'^Ag,x). 

Thus  Ad*gA  =  g~^Ag  and  hence 
Oa  -  {g^'^Ag:  g  &  G} 

=  {set  of  all  (real)  matrices  that  are  G/'^(n5  IR)-conjugate  to  A}. 

Also  dimOyi  can  be  any  even  number  <  —  n. 

Exercise:  Compute  all  possible  co-adjoint  orbits  Oa  in  the  case  of  2  x  2  matrices. 

Now  observe  that  each  x  E  Q  induces  a  function  Hx  on  Q*  in  a  natural  way,  as 
follows, 

Hx{A)  =  (A,  x)  =  tr  Ax. 

For  such  functions  Hx  we  have 


dHxl^A  Ad:.,  A]  =  ~ 

\dt  h=o  J  dt  t=o 


dt  n=:0 


HxiAdl.yA)  =  -  ^  Hx{e-^^Ae^y) 


tr(e  ^^Ae^^x)  —  —  tT{Ae^^xe  *^) 


=  tr(yl[2/,a:]) 

—  '^A  (  ~T2  AdgiyA. 

\  dt  t=o 


dt 


Ad:..  A  . 


yHAA)  =  -- 

But  then  this  means  that 


dt  lt=o 


Ad:..A=:--  = 

dt  t=o 


{Hx,  Hy}  =  WA{VH^,VHy)  =  tr  A[x,  y]. 

The  differential  dHx{A)  is  by  definition  the  (unique)  functional  on  Q* ,  and  hence 
the  unique  element  in  (Q*)*  —  Q,  such  that 


dHx{A){B)  =  -  ^  Hx{A-^tB) 


tr  x{A  -f  tB)  =  tr  xB, 


which  implies  dHx{A)  —  x,  and  we  see  that 


{Hx,Hy}(A)  =  tTA[dHx{A),dHy{A)] 


By  linearity  and  Leibniz’s  rule  we  conclude  that 


{HJ<}{A)  =  tr  A[dH(A),dK(A)] 
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for  arbitrary  smooth  functions  on 

Exercise:  Show  that  arbitrary  smooth  functions  on  G*  can  be  approximated  by 
finite  linear  combinations  of  powers  of  functions  of  type  . 

On  more  general  dual  Lie  algebras  the  above  formula  becomes 


In  the  case  of  G/+(n,  E),  for  H:  M(n,E)  M,  we  have 


£ 

dt 


t=o 


where  VH{A)  is  the  matrix  with  entries  •  Thus 

{H:K}{A)  =  tv  {A[VH'^(A),VK'^{A)]), 

Notice  that  what  we  have  really  constructed  is  a  Poisson  manifold  i.e.  G*  is  a 
manifold  with  a  bracket  {•,  •}  which  satisfies  all  the  conditions  for  a  Poisson  bracket, 
except  that  it  is  in  general  degenerate.  However  G*  is  foliated  by  symplectic  leaves 
i.e.  submanifolds  of  5*  which  are  symplectic  i.e.  on  which  the  2-form  becomes  non¬ 
degenerate.  These  symplectic  leaves  are  precisely  the  co- adjoint  orbits.  We  will  say 
much  more  about  this  construction  in  the  next  lecture. 

Exercise:  In  the  case  that  the  base  manifold  X  is  a  group,  say  G,  the  manifolds  in 
(i)  and  (ii)  are  related.  Basically  the  symplectic  structure  in  (ii)  is  a  pull-back  of 
the  structure  in  (i)  to  the  identity  in  G.  Make  this  explicit. 

(iii)  Constrained  systems 

Suppose  we  have  n  one-dimensional  harmonic  oscillators,  Xi  -\~  XiXi  —  0,  1  ^ 

i  <  n.  This  flow  is  generated  by  the  Hamiltonian  H  =  ^  E  2/?  + 

~  1=1 

E  ^  .  Suppose  we  now  constrain  the  oscillators  to  lie  on  the  sphere 


i  =  l 

How  would  we  describe  the  motion?  Well,  we  would  recall  how  we  solved  the 
problem  in  our  first  physics  course  of  a  particle  moving  along  a  wire,  and  proceed 
accordingly.  The  Hamiltonian  version  of  this  procedure  is  the  following. 

Let.  42  =  t  (=  Let  X  =  {(a:,y)  £  4i  =  <l>2  =  0}.  Clearly 

the  constrained  motion  should  lie  on  X  C  Moreover  X  is  even  dimensional  and 

it  carries  a  natural  2-form  Pw,  the  pull-back  of  w  under  the  immersion  i:  X 
Alternatively,  w\X  is  just  the  restriction  of  w  to  TX  C  As  the  operator  d 

commutes  with  the  restriction  (or  the  pull-back)  operation  it  is  clear  that  d[w\X) 
is  closed.  The  only  question  is  whether  it  is  non-degenerate. 
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Exercise:  Show  that  0,  and  hence  w\X  is  non- degenerate. 


n 

The  constrained  flow  is  generated  by  the  Hamiltonian  H  = 

i=l 


restricted  to  X  with  2-form  w\X. 


Exercise:  Compute  the  equations  of  motion  for  the  above  constrained  flow.  This 
constrained  system  is  called  the  Neumann  system.  As  a  reference  for  constrained 
motion,  see  for  example  [DLTr].  We  will  say  more  about  such  systems  in  the  next 
lecture. 


Question:  How  can  we  integrate  a  dynamical  system 
(3)  X  =  V{x) 


in  say?  Suppose  we  are  really  lucky  and  have  m  —  1  (independent)  conserved 
quantities  (^1, . . . ,  so  that  =  0,  j  =  1, . . . ,  m  —  1,  for  solutions  x{t) 

of  (3).  Then  we  could  solve  for  m  —  1  of  the  variables  in  favor  of  the  first  one, 
say,  and  then  we  would  be  left  with  the  equation  ^  ^2(^1) y  •  •  •  j 

which  can  then  be  integrated  by  quadrature. 

Now  in  the  theory  of  Hamiltonian,  as  opposed  to  general,  systems  a  remarkable 
reduction  occurs.  One  can  “solve”  systems  of  dimension  m  =  2n  which  have  only  n 
(independent)  integrals  0i, . . . ,  provided  that  these  integrals  have  the  additional 
property, 

=  0,  l<i,j<n, 

i.e.  the  integrals  Poisson  commute.  As  the  Hamiltonian  H  for  the  system  is  con¬ 
served,  we  may  always  take  one  of  the  integrals,  say  (^1,  equal  to  H. 

The  main  theorem  in  the  subject  is  the  so-called  Liouville-Arnold-Jost  Theorem, 
which  says  the  following: 

A  Hamiltonian  vector  field  Vh  is  called  integrable  (or  completely  integrable)  on  a 
domain  D  C  if  it  possesses  n  integrals  <^i(=  i/),  <^2?  •  •  • ,  which  are  linearly 
independent  on  D  (i.e.  d(j)i, . . . ,  are  linearly  independent  at  all  points  of  D)  and 
which  Poisson  commute. 

We  require  that  D  is  invariant  under  the  flow  generated  by  H  for  all  t.  Any 
Hamiltonian  system  is  locally  integrable  i.e.  given  any  H  and  a  point  m  G 
there  exists  a  neighborhood  B  of  m  such  that  H  has  n  Poisson  commuting  integrals 
in  this  neighborhood  (Exercise:  prove  this).  But  in  general  the  flow  generated  by 
H  escapes  from  H,  and  so  these  local  integrals  teach  us  nothing  about  the  global 
behavior  of  the  flow.  The  invariance  of  D  is  essential. 

Theorem  1  (Liouville-Arnold-Jost)  Suppose  that  H  =  (j>i  is  integrable  on  a  do¬ 
main  D  C  M  with  integrals  (j)  =  {(^1, . . .,  and  suppose  that  Nq  =  c  M 

is  compact  and  connected.  Then 

(a)  Nq  is  an  imbedded  n-dimensional  torus  T” 

(b)  there  exists  an  open  neighborhood  U{No)  C  M  which  can  be  coordinatized 

as  follows:  if  x  =  {xi, . . . ,  a^n}  are  variables  on  the  torus  and 

y  =  (yi, . .  .,2/n)  G  Di,  where  Dy  C  M”'  is  a  domain  containing  the  origin  0, 
there  exists  a  diffeomorphism 


iP:  rxDi  -^U{No). 
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Moreover  ij)  is  symplectic  (  ie.  A  dyi  j  and 


H  o 'll;  = 


for  some  function  h. 


In  particular,  near  a  compact,  connected  level  set  <^”^(0),  the  flow  is  extremely 
simple.  Indeed,  as  V*  is  symplectic,  the  flow  generated  by  H  is  again  Hamiltonian 
in  the  variables  (jc,  y)  of  x  D\  (Exercise:  show  this),  and  we  find 


h  =  Q, 


SO  that 

Xj{t)  =  Xj(0)+<^(yi,...,yn),  =%(0). 

Thus  the  system  can  be  integrated  explicitly  and  the  solution  is  given  by  straight 
line  motion  on  a  torus.  The  variables  {yj}  are  called  the  actions  for  the  system  and 
the  {iCj}  are  called  the  angles. 

The  Theorem  not  only  tells  us  how  to  integrate  the  system  (in  terms  of  the 
variables  {x,y),  which  may  or  may  not  be  hard  to  construct),  but  perhaps  more 
importantly,  we  can  understand  the  qualitative  behavior  of  the  system. 

The  quantities  w  =  (wi, . . .  ,Wn)  =  (^, . . . ,  are  called  the  frequencies  of 

the  system.  The  neighborhood  U(No)  in  the  Theorem  is  foliated  by  tori  which  we 
parameterized  by  the  values  of  y  £  Di.  As  w  =  w(y)j  and  as  y  is  conserved  by  the 
flow,  the  frequencies  are  constant  on  the  torus.  However,  in  general,  they  vary  from 
torus  to  torus.  We  can  distinguish  tori  according  to  whether  the  Wi^s  are  rationally 
independent  or  rationally  dependent . 

n 

Rationally  independent:  Here  ^  ^  ii  =  0.  In  this  case, 

i=l 

by  a  well-known  theorem  of  Kronecker,  {xq  -\-tw:  tEM}is  dense  in  T^.  Thus  the 
orbit  of  the  flow  is  dense  on  such  tori  and  in  fact  the  flow  is  quasi-periodic  in  time 
with  n  frequencies. 

n 

Rationally  dependent:  Here  there  exist  ji  G  not  all  zero,  such  that 

i-i 

Then  the  flow  is  restricted  to  a  sub-torus  of  T^.  For  example  if  n  =  2  and  wi-2w2  = 
0,  then  the  flow  is  restricted  to  {[x\^X2)  ^  X\  —  2iC2  —  const(modZ)}.  Again 
the  flow  is  almost  periodic,  but  with  fewer  frequencies. 

Thus  the  essential  problem  of  describing  the  long-time  behavior  of  integrable 
Hamiltonian  systems  is  solved,  in  principle,  provided  that  the  invariant  set 
is  compact  and  connected.  If  <^"^0)  is  compact  but  not  connected,  we  can  just 
restrict  our  attention  to  each  connected  component.  Also  if  (f;  ^(0)  is  not  compact, 
then  the  Theorem  goes  through  provided  it  is  known  a  priori  that  each  <j)i  generates 
a  global  flow,  at  least  for  data  near  (^”^(0).  In  this  case  one  learns  that  <j>  ^(0) 
has  a  neighborhood  ?7(iVo)  which  is  a  thickening  by  an  n-dimensional  disk  Di  of  a 
product  of  lines  and  circles, 


C/(Ao)  =  T^xE^-^xT>i. 
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On  each  leaf  x  x  {y}  the  flow  is  again  given  by  straight  line  motion,  but 

now  the  winding  takes  place  on  a  cylinder  rather  than  a  torus. 

The  Liouville-Arnold-Jost  Theorem  describes  qualitatively  the  behavior  of  an 
integrable  system.  In  each  case  the  serious  and  essential  task  remains  of  determining 
the  action-angle  variables  explicitly. 

Sketch  of  the  proof  of  the  Theorem: 

As  =  0,  =  0  which  shows  that  the  Hamiltonian 

vector  fields  induced  by  the  <^i’s  commute.  This  means  that  we  can  immerse 
into  as  follows.  Let  m),  -0^(0,  m)  =  m,  be  the  flow  induced 

on  by  Then,  fixing  mo  E  the  map 

takes  K”  into  the  level  set  No  =  {m:  =  (j>i[mo),  i  =  This  is 

because  (H)  =  {'^■i (m))  =  0,  so  that  <j>i  is  a  constant  of  the 

motion  for  the  (^j-flow,  1  <  i,j  <  n.  Simple  arguments  show  that  F  is  onto  No-  Let 
A  =  E  r(^)  =  mo}.  Now  as  the  flows  commute  it  follows  that  A  is  a  lattice 
in  M"'.  On  the  other  hand  M'^/A  is  mapped  diffeomorphically  onto  Nq.  But  No  is 
compact  by  assumption.  It  follows  that  A  must  have  n  generators  and  hence  E”'\A 
is  a  torus,  and  hence  Nq  is  a  torus.  Then  one  has  to  “thicken”  things* .... 

Elementary  examples: 

(i)  Harmonic  oscillator 

M  =  -{-w^q"^)  =  (j)i. 

The  system  is  integrable  as  0i  is  conserved  and  n  =  1  (clearly  all  Hamiltonian 
systems  on  2-dimensional  manifolds  are  integrable!).  No  =  {{q^p)-  H  =  <f)i  =  C  > 
0}  =  {(g,p):  —  2C'},  which  is  clearly  a  torus.  The  equations  of  motion 

are 

Hp  =p,  p=-Hq-  -vPq, 

with  solution 


a/2C 


W 


sm{wt  a] 


p  =  cos{wt  a). 


Note  p^  -f  w^q"^  =  2C. 

The  map  0  in  the  Theorem  is  constructed  as  follows.  We  can  take  Di  =  M'*'. 
Then 


f  I  ^  I'^y 

M+xT^  9  {y,x)^i;{y,x)  =  {q{y,x),p{y,x))  =  (  W  — sin27rir,  cos  27ra: 


We  have 


w 


H  oTp{y,x) 
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Exercises: 

(a)  The  motion  of  the  pendulum  depends  on  the  value  of  H  =  C  >  0.  Show  that 
there  are  three  different  cases,  C  <2,(7  =  2,C>2.  If(7<2,  the  pendulum 
oscillates  back  and  forth  with  \27rq{t)\  <  tt.  If  (7  >  2,  the  pendulum  rotates 
“over  the  top”.  If  C  ==  2,  the  pendulum  moves  from  27rq  =  -tt  to  27rg  =  +7r 
as  i  runs  from  — oo  to  +oo;  this  case  is  the  so-called  separatrix  for  the  system. 

(b)  Describe  A^o  -  {{q,p)‘  H{q,p)  =  C}  in  the  above  three  cases  and  draw  a 
picture  of  foliated  by  the  invariance  sets  Nq  =  No{C)  for  all  values  of 
C>  0. 

(c)  Construct  the  map  ip  of  the  Theorem. 
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Lecture  2 

The  Liouville- Arnold- Jost  Theorem  describes  the  flow  of  an  integrable  system, 
once  we  have  determined  that  the  system  is  indeed  integrable.  The  big  question 
is,  how  do  we  determine  whether  a  given  system  is  integrable?  The  true  analog  of 
this  question  already  arose  in  freshman  calculus.  You  are  given  a  list  of  twenty 
integrals  to  do  and  you  struggle  with  one  of  them,  say,  over^the  entire  weekend. 
Then  someone  tells  you  to  replace  the  variable  x  hy  y  log(l  +  ar^),  say, 

and  then  the  whole  problem  pops  out!  The  truth  is  that  there  is  no  systematic 
way  to  determine  whether  a  given  system  is  integrable.  You  have  to  guess  and  be 
lucky.  You  can  analyze  large  classes  of  systems  which  have  similar  features  and  free 
parameters  and  which  are  known  to  be  integrable,  and  see  whether  a  given  system 
fits  into  the  class.  You  can  also  test  a  system  to  see  if  it  behaves  like  an  integrable 
system.  Does  it  have  “soliton-like”  solutions,  for  example.  But  to  really  discover 
the  structure  that  makes  it  integrable,  you  have  to  guess  —  you  just  have  to  guess 
that  “y  =  log(l  -{-  will  do  the  job  for  you! 

Now  in  the  nineteenth  century  many  systems  were  shown  to  be  integrable.  For 
example,  in  a  famous  paper  Jacobi  showed  that  geodesic  flow  on  an  ellipsoid  (he 
considered  only  the  3-dimensional  case)  was  integrable.  This  example  generated 
great  interest  at  the  time  because  he  showed  that  the  functions  he  needed  to  solve 
the  flow  were  hyperelliptic  functions  and  that  the  flow  was  linearized  by  mapping 
certain  divisors  on  a  Riemann  surface  onto  the  associated  Jacobi  variety  via  the 
Abel  map  (for  a  recent  analysis  of  the  flow  see  [M2]).  Of  course  the  pride  of  19^^ 
century  mathematics  was  the  Theory  of  Riemann  surfaces,  and  the  fact  that  this 
theory,  recently  developed,  was  exactly  what  was  needed  to  explicitly  solve  some 
dynamical  system,  created  great  excitement  in  the  mathematical  world.  Other  ex¬ 
amples,  amongst  many  others,  were  the  Neumann  system  of  n  harmonic  oscillators 

Xi  inf  Xi  —  0,  1  <  i  <  n,  constrained  to  lie  on  a  sphere  xf  =  \  (this  system 

was  shown  to  be  integrable  in  the  case  n  =  3  by  C.  Neumann  —  for  a  recent  anal¬ 
ysis  of  the  general  case,  see,  for  example,  [M2]),  a  variety  of  spinning  tops,  and 
of  course  Sonia  Kovalevskaya’s  extraordinary  discovery  of  her  integrable  top  with 
its  deep  connections  to  complex  function  theory  and  the  origins  of  Painleve  theory. 
Geometric  symmetries  of  a  system  give  rise  to  integrals  of  the  motion,  a  fact  most 
clearly  understood  by  Emmy  Noether  —  for  example,  translation  invariance  gives 
rise  to  conservation  of  momentum,  etc.  But  this  rarely  gives  us  enough  integrals: 
in  general  we  must  find  other  integrals  which  people  like  to  think  of  as  arising  from 
“hidden  symmetries” .  The  whole  exuberant  development  of  the  theory  of  integrable 
systems  came  to  an  abrupt  halt  at  the  end  of  the  last  century,  however,  as  a  result 
of  a  theorem  of  Poincare,  who  showed  for  the  3-body  problem  that,  apart  from 
the  geometric  integrals,  there  are  no  other  conserved  quantities.  At  this  point  the 
explicit  solution  of  dynamical  systems  declined  as  an  area  of  mathematical  activity, 
and  the  so-called  qualitative  theory  of  dynamical  systems,  pioneered  by  Poincare 
himself,  came  to  the  fore. 

This  was  the  situation  for  almost  80  years,  until  the  remarkable  discovery  of 
Kruskal  and  his  collaborators  (see  [GGKM]).  In  1967  they  showed  that  the  Korteweg 
de  Vries  (KdV)  equation 

(4)  qt  -  ^qqx  +  qxxx  =  0 
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of  shallow  water  wave  theory  was  “integrable”  in  a  precise  sense  and  they  showed 
how  to  linearize  the  flow  explicitly.  The  extraordinary  message  of  their  work  was 
that  the  integrals  for  the  flow  were  contained  in  spectral  and  scattering  theoretic 
quantities  of  an  associated  operator 

H{t)  =  -^  +  q{x,t). 

The  fundamental  significance  of  this  work  was  quickly  grasped  by  P.  Lax  [L], 
who  showed  that  the  KdV  equation  was  equivalent  to  an  isospectral  deformation  of 
H(t).  Indeed  he  showed  that 

(4)  =  ^  =  [B{t),Hit)]  =  Bit)H{t)-H{t)Bit), 

where  B{t)  =  -4^  +  3  (<?(*,«) ^  + 

To  see  that  the  RHS  is  indeed  an  isospectral  deformation  of  H(t)  let  Q(i)  be  the 
solution  of 

^  =  -QB,  Q{t  =  0)  -  /. 

Then  =  BQ* ,  as  R  =  —5*,  and  hence 

L(^QQ*)  =  -QBQ* +  QBQ* 
at 

so  that  QQ*  =  I.  A  similar  argument  shows  that  Q*Q  =  I,  and  hence  Q  is  unitary. 
Set  L{t)  =  Q*(t)H(0)Q{t).  Then 

4l  =  BQ*H{0)Q  +  Q*H{0){-QB)  =  [B,  L], 
at 

As  L(0)  —  -^(0),  we  conclude  that  H{t)  =  L(t)  =  Q* {t)H{0)Q{t),  i.e.,  H{t)  is 
unitarily  equivalent  to  H{t  =  Q).  In  particular,  if  X[H{t))  is  an  eigenvalue  of  H{t). 
Then  X{H{t))  ~  const.,  etc.  Thus  the  spectrum  of  H(i)  gives  integrals  for  the  KdV 
equation.  These  are  the  integrals  that  arise  from  “hidden  symmetries”. 

If  the  matter  would  have  stayed  there,  we  would  just  have  been  looking  at  some 
glorious,  but  isolated,  example.  But  now  we  know,  some  27  years  later,  and  due 
to  the  efforts  of  many  people,  that  there  are  literally  hundreds  of  systems  of  prime 
physical  interest  that  are  integrable  via  a  spectral  transform.  More  precisely,  at¬ 
tached  to  the  dynamical  system  of  interest,  there  is  an  associated  linear  operator, 
and  the  motion  of  the  system  is  equivalent  to  an  isospectral  deformation  of  the 
operator,  and  hence  the  eigenvalues,  etc.,  of  the  operator  provide  integrals  for  the 
flow.  And  indeed  many  people  in  the  “integrable  business”  believe  that  a  system 
is  integrable,  if  and  only  if  it  is  equivalent  to  the  isospectral  deformation  of  some 
associated  operator. 

In  the  rest  of  this  lecture  I  am  going  to  tell  the  story  of  just  one  such  integrable 
system.  I  hope  you  won’t  mind  my  focusing  on  this  system,  but  its  story  contains  the 
elements  of  all  other  integrable  systems.  I  am  going  to  consider  the  so-called  Toda 
Lattice,  consisting  of  n  particles  on  the  line  interacting  with  exponential  forces.  Thus 
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we  consider  the  flow  generated  on  w  =  dxi  A  dyi  j  by  the  Hamiltonian 

n  n  —  1 

Hx  =  ^  A  general  reference  for  the  approach  that  we  will 

i-l  izzl 

follow  is  [DLTo]  and  the  references  therein. 

The  so-called  Toda  lattice  equations  generated  by  Ht  have  the  form 


(5)  if  =  l<i<n, 

where  ~  g®n-a?n+i  =  q  system  has  the  standard  geometric  integrals 

n 

Ht  =  energy  and  P  =  yi  ~  momentum,  but  it  was  clear  from  the  numerical 
1=1 

calculations  of  J.  Ford  in  the  mid  70 ’s  that  something  very  special  and  ‘‘KdV-like” 
was  going  on  with  these  equations.  And  indeed  in  1975  Flaschka,  and  independently 
Manakov,  showed  that  the  Toda  lattice  was  equivalent  to  an  isospectral  deformation, 
as  follows. 

Introduce  Flaschka’s  variables. 


ctk  =  ~yk/^,  ^=l,...,n, 


2 


k  —  1,  .  .  .  ,  71  —  1, 


and  consider  the  Jacobi  matrix 


ai  bi 
bi  02  ^2 


and  the  associated  matrix 


B  =  B{L)  = 


bn-l 

bn—1 


bi 

0  62 


O  -^n-l  0 


=  -B^. 


bn—l 


Note  that  B  =  —  L'^  where  denotes  the  upper  part  of  L.  A  straightforward 


calculation  shows  that 


(5)  =  = 


Thus  the  eigenvalues  Aj , . . . ,  of  the  associated  matrix  L  give  n  integrals  for  the 
Toda  flow.  Moreover  (Exercise)  one  can  show  that 

{AijAj}  —  0  1  <  <  71 


in  w  =  ^  <^yi  j  •  Hence  the  system  is  completely  integrable. 
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According  to  the  Lion ville- Arnold- Jost  Theorem,  in  appropriate  variables,  the 
flow  becomes  a  straight  line  motion  on  some  cylinder  x  T”  What  are  these 

variables?  Associated  to  each  A^,  there  is  a  normalized  eigenvector  Vi^  Y2  U)  ~~ 

j=i 

All  eigenvalues  are  simple  (Exercise)  and  without  loss  of  generality  (Exercise) 
we  can  and  do  specify  Vi  uniquely  by  choosing  Ui(l)  >  0.  Label  the  eigenvalues 
such  that  Ai  <  •  •  •  <  An.  Differentiating  the  eigenvalue  relation  {L  —  Xi)vi  =  0  with 
respect  to  we  obtain 

(L  —  Ai)uj  -f  (T  —  Ai)ut-  =  0, 

or,  as  Xi  =  0, 

(jBL  —  LB)vi  T  (L  —  Aj5)ui  =  0, 

which  implies  that 

(L  -  Xi){vi  —  Bvi)  =  0. 

Hence  Vi  —  Bvi  ~  fivi  for  some  scalar  /i  =  But  (u^,  u^)  =  1,  and  so  {viVi)  =  0. 
Thus  II  =  =  ii[{ivi,Vi)  -  {vi,Bvi)]  =  0,  as  B  =  -B^.  This  proves  that 

Vi  =  Bvi.  In  particular 


But  {L  -  Ai)ui  =  0  implies,  in  particular,  that  (ai  -  Ai)ui(l)  -f  6it>*(2)  =  0,  and  we 
obtain  the  relation 

t)i(l)  =  (Ai  -  ai(<))i;i(l). 

In  turn,  for  i  i,  ^  log(ui(l)/u^- (1))  =  A,-  -  A^,  which  leads  to  the  formula 


On  the  other  hand,  4  E  ^  E  ^  shows  that 


^i(l) 

Xcm  =  centre  of  mass  =  -  2^  ^cig 


2  <  i  <  n 


are  the  variables  which  move  linearly  under  (5). 

From  the  point  of  view  of  the  Liouville-Arnold-Jost  Theorem,  where  is  the  cylin¬ 
der  X  We  have  the  map  y, 

J  =  n  X  n  Jacobi  matrices  9Li — )■  x  (L)  =  (Ai , . . . ,  An,  ui(l), . . . ,  Un(l)) 

e  Ts  |(a,^)  eK^":  a,  <  ■•■<  an,A  >  ^ 
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One  can  show  (Exercise)  that  x  is  a  bijection  of  J  onto  Y .  Consider  the  invariant 
set 

{a}  X  K  X  |/?  e  R":  A  >  ■ 

Observe  that  log  =  2, . . . ,  n,  are  clearly  coordinates  for 


i=i 


which  shows  that  this  set  is  —  h  Thus  there  is  no  torus  and  the  invariance  set 
is  M-  X  T---  M-.  The  variables  {A,},  1  <  z'  <  n,  log  2  <  i  <  n,  are, 

essentially,  the  action-angle  variables  for  the  Toda  flow. 

The  above  solution  of  the  Toda  lattice  is  due  to  Moser  [Ml].  Furthermore,  Moser 
showed  that  as  ^  oo, 


bi(i)  =  0 

Xi  =  yi  zz  —2ai{t)  const. 

Physically  this  means  that  as  f  oo,  particle  xi  moves  out  to  the  left  at  a  steady 
speed,  followed  by  a; 2,  followed  by  X3,  etc. 

In  another  important  development  in  1980,  Bill  Symes  made  a  wonderful  obser¬ 
vation.  Let  Lq  zz  L{i  zz  0)  and  let  —  Q[t)R{t)  be  the  (unique)  Qi?  factorization 
of  i.e.  Q  is  orthogonal  and  R  is  upper  triangular  with  Ra  >  0.  This  is  just  the 
Gram-Schmidt  procedure  applied  to  the  columns  of  . 

Then,  remarkably, 

L(t)  zz  Q'^{i)L^Q{t) 

is  the  solution  of  the  Toda  lattice  equations  (5)  with  L(0)  =  Lq. 

Exercise:  Verify  this  by  direct  calculation. 

Observe  also  that  ==  g^(l)e^«Q(l)  Q^(l)Q(l)i?(l)Q(l)  i^{l)g(l). 

Thus  (the  exponential  of)  L{t^  at  time  t  =.  \  can  be  obtained  from  (the  exponential 
of)  Lq  by  simply  permuting  the  factors  Q{1)  and  i^(l)! 

At  this  point  Tomei,  Nanda,  L.C.  Li  and  I  became  interested  in  the  problem  from 
the  following  point  of  view:  the  Toda  lattice  in  fact  gives  an  algorithm  to  compute 
the  eigenvalues  of  a  matrix  Lq  (here,  for  the  moment  Lq  =  Lq  =  tridiagonal). 
Indeed,  solve  (5)  with  L(0)  =  Lq.  As  t 00,  bi{t)  0.  When  |6,(^)|  <  e  <  1,  the 
spectrum  of  L{t)  is  essentially  given  by  the  diagonal  elements  of  L{t),  and  hence 
by  isospectrality,  this  gives  the  eigenvalues  of  the  original  matrix  Lq!  Try  it  out  on 
your  hand  calculator:  it  works! 

Our  main  question,  however,  was  whether  this  Toda  algorithm  has  anything  to 
do  with,  and  perhaps  is  competitive  with,  the  standard  methods  for  computing  the 
eigenvalues  of  a  matrix.  Suppose  we  have  a  matrix  Lq.  How  do  we  compute  its 
eigenvalues?  In  98/100  cases,  if  you  go  to  a  computer  and  use  an  eigenvalue  package, 
it  will  be  based  on  the  so-called  QR  algorithm.  The  heart  of  the  algorithm,  which 
is  probably  the  most  successful  algorithm  in  numerical  analysis,  is  the  following. 
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Factor 

Lq  =  QqRq. 

Write 

Li  =  RoQo  =  QqLoQo  (so  that  spec(Li)  =  spec(io)) 

and  factor 

Li  =  QiRi. 

Write 

L2  =  RiQi  (so  that  spec(L2)  =  spec(Li)  =  spec(Lo)) 

and  factor 

L2  =  Q2R2 

etc.  In  this  way  we  produce  a  sequence  {Lk  }  of  isospectral  (in  fact  unitarily  equiv¬ 
alent)  matrices.  In  many,  many  cases  Lk  converges  as  — >  00  to  a  diagonal  matrix 
Loo ,  whose  diagonal  entries  must  therefore  give  the  eigenvalues  of  the  original  matrix 

lol’ 

Question:  does  this  method  have  anything  to  do  with  the  Toda  lattice? 

We  made  the  following  observation.  Let  Hqr{L)  =  tr(I/logL  —  L).  Through 
Flaschka’s  transformation,  in  the  tridiagonal  case,  this  is  a  function  on  Hence 
it  gives  rise  to  a  flow  Lq),  (t>QR{0,  Lq)  =  Lq. 

Then  we  showed  that 

<l)QR{k]  Lq)  =  Lk,  the  step  in  the  QR  algorithm. 

In  other  words,  we  proved  the  following  result. 

Stroboscope  Theorem  (tridiagonal  case) 

The  QR  algorithm  is  the  time  1  map  of  a  completely  integrable  Hamiltonian 
system  which  commutes  with  the  Toda  flow.  Furthermore  the  flow  can  be  solved 
explicitly  by  a  matrix  factorization. 

Note:  Hqr  —  X]  ■“  =  2  tr  =  2  ^  A^- .  Thus 

i=i  *=i 

{Hqr,Ht},  Aj}  =  0,  as  {Aj,Aj}=0. 

At  this  point  we  must  introduce  some  information  from  Lie  group  theory.  You 
cannot  tell  a  Lie  theorist  that  some  dynamic  is  taking  place  on  the  space  of  tridi¬ 
agonal  matrices  without  eliciting  the  following  response  (B.  Kostant,  -  79):  the 
tridiagonal  matrices  form  a  (minimal)  co-adjoint  orbit  of  the  action  of  the  lower 
triangular  group  C  on  its  dual  Lie  algebra  t ,  presented  as  the  symmetric  matrices. 

This  remark  opens  up  the  whole  game.  As  we  saw  in  Lecture  1,  dual  Lie  alge¬ 
bras  are  carriers  of  a  (Kostant-Kirillov)  Poisson  bracket  and  co-adjoint  orbits  are 
the  symplectic  leaves  of  this  Poisson  manifold.  In  particular  we  can  lift  the  Toda 
lattice  to  a  flow  on  general  symmetric  matrices.  So  we  can  take  ^  ^  tr  L  as  a 

Hamiltonian  on  t.  With  the  Kostant-Kirillov  bracket  this  generates  a  flow  on  gen¬ 
eral  symmetric  matrices  which  has  the  same  form  (Exercise)  as  in  the  tridiagonal 

i^  =  [BiL),L],  B(L)  =  L+-Ll. 

at 


case 
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n 

Remark.  The  push  forward,  under  Flaschka’s  transformation,  of  the  form  ^  dxiA 

i=i 

dyi  to  the  Jacobi  matrices  J,  induces  a  2-form  which  is  a  non-trivial  multiple  of  the 
2-form  that  J  inherits  as  a  co- adjoint  orbit  of  £.  This  explains  why  we  must  take 
rather  than  Ht. 

Now  the  QR  algorithm  also  applies  to  general  symmetric  matrices  Lq  (as  long 
as  detLo  /  0).  We  generate  as  before  Lq,  Li,  L2, . . . ,  Lk, . . . ,  with  Lk  Loo  ~ 
diagonal  as  Ar  00. 

Question:  Is  the  Stroboscope  Theorem  still  true? 

On  generic  co-adjoint  orbits,  which  have  dimension  2[n^/4]  (Exercise),  the 
answer  is  yes!  What  are  the  new  integrals?  The  eigenvalues  of  L  are  certainly  still 
conserved.  This  gives  ~  n  conserved  quantities.  But  we  need  ^  [^^/4]  —  n  more. 

Consider  a  4  x  4  example.  From  det(L  -  A)  =  0  we  obtain  four  eigenvalues 
^10)  ^20,  A30,  A40,  say.  Now  consider  the  3x3  matrix  (L  —  A)i ,  obtained  by  chopping 
the  first  row  and  last  column  from  L  —  \ 


Then  we  consider  det(T— A)i  =  0,  which  has  two  solutions  An,  A21.  Now  it  turns  out 

that  a  generic  co-adjoint  orbit  is  given  hy  <  L  =L^\  Ajo  =  const,  Y  =  const  \ 

I  i=i  i=i  J 

which  has  dimension  10-2,  which  equals  2 [4^/4]  =  8,  as  it  should.  Thus  we  need  four 
integrals.  Fortunately  An,  A21  turn  out  to  be  additional  integrals  for  the  flow  and 
moreover,  all  the  integrals  Aio,  A20}  A30,  A40,  An,  A21  Poisson  commute  with  each 
other.  Thus  we  may  take,  for  example,  the  four  independent  integrals  to  be  Aio,  A20, 
A30  and  An-  Again  the  associated  angles  are  the  first  components  of  the  associated, 
appropriately  normalized,  eigenvectors.  If  n  >  4,  then  we  consider  det(L  —  A)2  ==  0, 
where  (L  —  A)2  is  obtained  by  chopping  the  first  two  rows  and  the  last  two  columns 
from  L  —  A,  etc. 

Now  what  about  general  real  matrices  X  G  M(n,M)?  The  equation  ^  = 
[B(L),  L],  B  =  L4-— still  makes  sense  and  generates  a  global  flow  (as  B  ~  —B'^). 
We  need  a  symplectic  structure,  and  in  light  of  the  foregoing  discussion,  we  under¬ 
stand  that  what  we  really  need  is  a  Lie  group  and  its  dual  Lie  algebra.  Consider 
invertible  matrices  L,  L' .  Factor  L  =  QR,  V  =  R'  and  define  a  product 

LoV  =  QQ'R!R. 

It  turns  out  that  with  this  product  the  invertible  matrices  form  a  Lie  group  G  —  Gqr 
with  dual  Lie  algebra  Q  —  Qqu^  Generic  co-adjoint  orbits  have  dimension  ’n?  —  n 
and  if  we  look  at  the  flow  generated  by  |  tr  through  the  Kostant-Kirillov 

form  on  Gqti  we  again  get  the  generalized  Toda  flow  L  =  [B,L].  Also  the  QR 
algorithm  still  applies. 

Question:  Is  the  Stroboscope  Theorem  still  true? 
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On  generic  orbits,  the  answer  is  again  yes! 

We  now  need  (n^  -  n)/2  integrals.  Approximately  [n^/4]  intepais  can  be  ob¬ 
tained  by  chopping  L  -  A  as  before.  Now  consider  the  polynomial  in  two  variables, 

P[h,  z\  L)  =  det(L  +  hL^  —  z)  =  - 

It  turns  out  that  the  give  the  remaining  approximately  [71^ A]  integrals.  In 

other  words  the  whole  Riemann  surface 

P{h,z-,L)  =  0 

is  conserved  by  the  flow.  Moreover  the  flows  linearize  on  an  appropriately  singular- 
ized  Jacobi  variety  associated  with  the  curve.  Again  the  QR  algorithm  is  the  time  1 
map  of  the  flow  generated  by  =  tr(L  log  L-L)  through  the  Kostant-Kinllov 

structure  on 

Not  only  are  the  ideas  and  constructions  introduced  in  this  lecture  of  theoretical 
interest,  but  they  can  also  be  used  to  analyze  the  performance  of  eigenvalue  and 
singular  value  algorithms.  However,  that’s  another  story  .... 
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Lecture  3 

In  the  previous  lecture  we  described  how  the  QR  algorithm  can  be  viewed  as  the 
time  1  map  of  a  Hamiltonian  flow.  Moreover  the  associated  symplectic  structure  is 
the  Kostant-Kirillov  2-form  on  co- adjoint  orbits  of  a  specific  Lie  group  Gqr.  As 
a  set  Gqr^  the  Lie  algebra  of  Gqr,  is  simply  the  set  of  all  real  n  x  n 

matrices,  but  using  the  definition  of  the  multiplication  rule  on  Gqr^  it  is  easy  to 
see  (Exercise)  that  the  Lie  bracket  on  Gqr  is  given  by 

[M,  M']qr  =  niM']  -  [ttsM,  tt.M'] 

where  tt^M  =  -  Mj,  tt^M  =  (1  -  'n‘s)M  =  M_  -h  Mo  +  Mj.  Here  M_,  Mo,  M+ 

are  the  strictly  lower,  diagonal  and  strictly  upper  parts  of  M  respectively,  M  = 
M_  -f  Mo  +  M^,  and  [*,  •]  denotes  the  standard  commutator.  The  dual  Lie  algebra 
Gq^  can  be  identified  with  Gqr  through  the  non-degenerate  pairing 

(M,  N)  =  tr  MN. 

In  the  previous  lecture  we  mentioned  the  basic  “conjecture” ,  strongly  believed 
by  many  people,  that  every  integrable  system  is  an  isospectral  deformation  of  some 
associated  linear  operator.  There  is  a  more  refined  conjecture  that  I  now  want  to 
describe. 

Given  a  Lie  algebra  {G[‘,  •]),  a  linear  operator  R:  G  G  said  to  be  a  classical 
R~matrix  (reference  [STS])  if  the  bracket  on  G  given  by 

yU  =  ^([-^*>2/]  +  [®.  ^^2/]) 

is  again  a  Lie  bracket.  As  [•,  is  bilinear  and  skew,  the  only  issue  is  whether  it 
satisfies  the  Jacobi  identity.  Thus  if  i?  is  a  classical  i^-matrix  on  G  we  obtain  two 
Lie  algebras  {Gy  [*,  •])  and  {Gr  =  Gy  [-,  •]«),  and  two  associated  Lie-Poisson  brackets 
(see  Lecture  2), 

{FuF2}{a)  =  a{[dF,{a),dF2{a)]),  a  e  G* , 

{FuF2}R{a)  =  a{[dFi{a),dF2{a)]R),  aeG*R  =  G*. 

We  call  the  first  Lie  algebra/bracket  the  free  Lie  algebra/bracket  and  the  second 
the  interacting  Lie  algebra/bracket. 

An  interesting  and  useful  sufficient  condition  for  [•,  ']r  to  be  a  Lie  bracket  is  the 
so-called  modified  Yang-Baxter  equation  (mYB), 

(mYB)  [Rx,  Ry]  -  2R{[xy  y]R)  ~  -[x,  y],  for  all  x,y  ^G- 

Thus  if  R  satisfies  (mYB),  [-,  -jj?  is  a  Lie  bracket.  Now  it  is  a  fundamental  observa¬ 
tion  (Exercise)  the  (mYB)  can  be  reformulated,  equivalently,  as 

(6)  [{R±l)x,{R±l)y]  =  2{R±l)[x,y]R,  for  all  x,  t/ E  6. 

This  shows  immediately  that  R  gives  rise  to  a  (not  necessarily  direct)  decomposition 

ofg, 


G  =  G^  G~y 
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into  a  sum  of  subalgebras  Q±  =  {1  :t  R)Q .  Set  7r±  =  so  that 

7  =  7r^_  +  7r_ ,  R  —  1:^  — 

Exercise  1;  If  the  decomposition  is  direct,  then  7r±  are  complementary  projections. 
Conversely,  if  5  =  @  is  a  direct  decomposition  of  G  into  subalgebras  with 

associated  projections  7r±,  show  that  7?  =  —  7r_  solves  (mYB). 

Exercise  2:  For  Q  =  gl{n,R),  R  =  tti  -  tVs  solves  (mYB)  and  [•,  -Jk  is  precisely 
the  Lie  algebra  [•,  *]gB  above.  Here  5+  =  {lower  triangular  matrices}  and  Q-  = 
{skew  matrices}. 

Let  G  be  a  Lie  group  with  Lie  algebra  Q.  We  say  that  F:  G*  — )•  IR  is  Ad*- 
invariant  if 

F{Adla)  -  F{a) 

for  all  a  GG* ,g  ^  G. 

Example.  G  =  Gl(n,R),F{a)  =  ira^ ,  a  eG*  ^G-  We  showed  in  Lecture  1  that 
Ad*a  =  g'^^ag.  Hence  F{Ad*ga)  =  tT{g-'^agY  =  tr  g~'^a^g  =  U  =  F{a). 
Finally  we  consider  an  invariant,  nondegenerate  pairing  (*,  ■)  on  G:  i-©- 

{x,  [y,z])  +  {[x,y],z)  =  0,  for  all  x,y,z  GG- 

Exercise:  Show  that  {x,y)  =  ti  xy  is  an  invariant,  non-degenerate  pairing  on 
G  =  gl{n,R). 

It  is  straightforward  to  see  that  yld* -invariant  Hamiltonians  H  generate  trivial 
flows  through  {•,•},  but  in  the  interacting  structure  {t}/?,  H  gives  rise  to  the 
differential  equation 

(7)  ^  =  [v-dH(a),a] 

on  which  is  identified  with  Gr  =  Q,  through  the  invariant  pairing. 

Exercise:  For  77  |  dH{L)  =  L  and 

77,{dH{L))  =  tt,L  =  L+  -  =  B(L). 

This  verifies  that  H  generates  the  generalized  Toda  flow  ^  =  [B{L),L]  on  Gr  = 

^^e  are  interested  in  the  above  definitions  and  calculations  primarily  because  of 
the  following  basic  result. 

Theorem  1  Ad* -invariant  functions  Poisson  commute  on  (Gr,{‘^  ‘}r)' 

Theorem  2  Suppose  H  is  Ad* -invariant  and  that  the  decomposition  G  =  G+®G- 
zs  direct.  Let  oo  €  be  given.  Then  for  some  0  <T  =  T{ao)  <  00,  there  exists  a 
unique  decomposition 

for  0  <t  <T,  luhere  g±{t)  €  exp5±,  <^^d 

a(f)  = 

solves  (7)  for  0  <  f  <  T,  xvith  af(0)  =  oq. 
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Example.  In  the  QR  case, 

exp  5+  =  identity  component  of  the  group  of  lower  triangular,  invertible  matrices, 
expQ-  =  identity  component  of  the  orthogonal  matrices. 

Now  foT  H  =  ^  tr  dH{L)  =  L  and 

^tdH{Lo)  _  gtLo  _  g_^g_^  g_^  otthogonal,  lower  triangular, 

and  L{t)  =  =  g'^Log^.  This  is  exactly  the  Symes  procedure 

described  in  Lecture  2  (with  upper  triangularity  replaced  by  lower  triangularity) . 

Proof  of  Theorem  1,  Differentiating  Fi{Adlt^a)  =  Fi{a),  i  =  1,2,  with  respect 
to  f  at  t  =  0,  we  obtain  for  any  x  G  5,  dFi{a)])  =  0.  Thus 

{F,,F3}fi(a)  =  a{[dFi(a),dF2{a)]n)  =  ^a([RdFda),dF2{a)]) 

+^a{[dFi{a),RdF2{a)]) 

=  ^a([RdFi{a),dF2ia)])  -  ^a([RdF2{a),dFi{a)]) 

=  0,  by  the  invariance  of  F2  and  Fi. 

In  particular,  if  {Ai}  are  the  eigenvalues  of  M  G  M(n,M)  in  the  QR  case,  then  the 
{Aj}  are  clearly  Ad*-invariant  and  hence  {Ai,Ai}(M)  =  0,  which  shows  that  the 
eigenvalues  give  n  commuting  integrals  for  the  Toda  flow. 

Exercise:  Prove  Theorem  2  by  direct  calculation. 

We  are  now  ready  to  consider  infinite  dimensional  systems.  Of  course,  an 
infinite-dimensional  system,  the  KdV  equation,  was  responsible  for  all  the  develop¬ 
ments  that  we  have  been  discussing  so  far.  But  as  is  often  the  case,  the  motivating 
example  in  a  new  theory  is  often  not  the  simplest  and  most  basic  example.  Now  we 
know,  for  many  reasons,  some  of  which  will  become  clear  during  this  and  the  next 
lecture,  that  the  Nonlinear  Schrodinger  (NLS)  equation  is  the  simplest  example  of 
an  infinite  dimensional,  integrable  Hamiltonian  system  and  without  further  apolo¬ 
gies  I  am  going  to  restrict  my  attention  to  this  equation.  The  discovery  that  the 
NLS  equation  is  integrable  is  due  to  Zakharov  and  Shabat  [ZS]. 

The  NLS  equation  has  the  form 

iyt  +  Vxx  -  S/clypy  =  0, 

where  k  =  1  is  the  defocusing  case  and  k  =  —1  is  the  focusing  case.  Both  cases  are 
of  great  interest,  but  again,  without  any  further  apologies,  I  am  going  to  restrict 
myself  in  this  and  the  next  lecture  to  the  simpler  case  when  k  —  1,  i.e.  the  defocusing 
case.  So  from  now  on  we  only  consider 

iyt  +  yxx  -  2\y\^y  =  0. 


NLS 
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This  equation  can  be  written  in  generalized  Lax-pair  form  as 


-  \Q.P 


where 


P  =  -iza^  -{- 


Q  =  —2iz‘^cr3  -f  2z 


Exercise:  Check  this!  Also  verify  that 


-\v? 


where 


P  =  —iz(T3  —  i 


Q  =  ~-2iz^(T3  ~  2iz 


iyx  -\y\ 


is  the  generalized  Lax-pair  form  for  the  focusing  NLS  equation.  This  information 
will  be  needed  in  Dave  McLaughlin’s  lecture. 

Remarks 

Equation  (8)  is  slightly  different  from  the  Lax-pair  for  the  KdV  equation  de¬ 
scribed  in  Lecture  2.  Here  the  spectral  parameter  2:  is  present  in  P  and  in  Q  and, 
in  particular,  the  dependence  of  Q  on  2:  is  quadratic.  However,  arguing  as  before, 
we  see  that  for  any  fixed  2^, 


for  some  invertible  operator  U{i).  In  particular  the  spectrum  of  the  generalized 
eigenvalue  equation 


—il)  -  P{t,  z)i)  -  ~iza3  + 
ax  V 


is  conserved  by  the  NLS  equation. 

Observe  that  equation  (8)  can  be,  and  usually  is,  written  in  the  form 

which  is  the  so-called  zero- curvature  form  of  the  equation. 

Now  I  want  to  draw  your  attention  to  a  basic  difference  between  finite  and  infinite 
dimensional  systems.  That  is,  the  answer  to  a  given  infinite-dimensional  problem 
depends,  far  more  than  in  the  finite-dimensional  case,  on  the  context  of  the  problem. 
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For  example,  the  NLS  equation  can  be  thought  of  as  describing  solutions  which  are 
periodic  in  y{x,t)  =  y(x  +  l,t),  say,  or  on  the  other  hand,  solutions  which  decay 
as  \x\  oo,  y{x,t)  -)■  0  as  \x\  oo,  the  so-called  scattering  situation.  The  behavior 
of  the  system  is  radically  different  in  these  two  situations.  Moreover,  even  within 
a  given  context,  a  particular  question  can  have  different  answers,  depending  on 
the  detail  required.  For  example,  in  the  scattering  situation  for  NLS,  we  may  ask 
whether  y{x,t)  — >■  0  as  i  ->■  oo.  If  we  are  speaking  about  compacta  in  M,  the  answer 
is  yes,  but  considering  the  system  more  globally,  \y(x,t)\^dx  cannot  go  to  zero 
as  t  ->  oo:  in  fact,  it  must  be  constant. 

In  the  finite-dimensional  case,  there  is  usually  only  one  reasonable  answer  to  a 
question.  But  in  the  infinite-dimensional  case,  the  situation  is  far  more  subtle,  with 
different  answers  depending  on  the  analytical  nature  of  the  question.  These  remarks 
must  be  kept  in  mind,  in  particular,  when  we  try  to  apply  dynamical  system  methods 
to  PDE’s.  These  methods  give  us  a  guide  what  to  look  for,  but  now  each  question 
can  have  many  different  answers.  So,  again  without  any  apologies,  I  am  going  to 
restrict  my  attention  to  the  scattering  Ccise  for  the  defocusing  NLS  equation, 

y{x)  0  sufficiently  rapidly  as  |x|  ->  oo. 

The  Hamiltonian  character  of  the  defocusing  NLS  equation  in  the  scattering  situation 
is  discussed,  for  example,  in  [FT]. 

In  the  scattering  situation  (see,  for  example,  [BC]),  there  exists  for  all  z 
a  unique  solution  ^  of  the  eigenvalue  equation  with  the 

properties 

m  =  as  x  ^  -hoo, 

m(x,z)  is  bounded  as  x  — oo. 

Using  the  (generalized)  Lax-pair  form  for  NLS,  we  find,  as  in  the  case  of  Toda, 
that  =  <9^  -h  for  some  matrix  C.  Letting  x  ±oo,  and  using  the  given 
asymptotics  of  m,  we  obtain  C  =  2iz^(Ts.  Thus  ^  H-  2iz‘^'^crs. 

Now  it  turns  out  that  for  all  x  and  t, 

(x,  t,  z)  =  lim^(a;,  t,z±  is) 

eiO 

exist  for  all  ^  6  M.  Also  are  clearly  both  solutions  of  =  P^±:  hence  there 
exists  a  matrix  v  =  v{t^  z),  z  G  M,  such  that 

(9)  ^^{x,t,z)  =  ^.{x,t,z)v{t,z). 

For  obvious  reasons,  v  =  v{i,z)  is  called  the  jump  matrix  for  ^(a:,t,  •).  Differenti¬ 
ating  (9)  with  respect  to  t,  we  find  ^4.t  =  -h  ^-Vt,  i.e.  2iz^^^(T3  -|-  = 

2iz^^-(Tzv  -{-  Q^-V  -f-  which  implies 

(10)  vt  =  2iz^[v,<T3]. 

This  is,  explicitly,  the  linearization  of  the  flow\  Indeed  inverse  scattering  theory 
(see,  for  example,  [FT]  or  [BC])  tells  one  that  the  map  y  v  is  1-1  (and  onto,  for 
v  in  a  suitable  space).  Thus  the  v's  (or,  more  precisely,  the  r’s  from  which  the  v’s 
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are  constructed  —  see  below)  are  essentially  the  “angles”  for  the  problem.  Solving 

(10)  we  obtain 

where  v{z)  is  the  jump  matrix  for  the  scattering  problem  at  time  ^  =  0. 

One  shows  easily  (see  [FT]  or  [BC])  that  v  has  the  form 


l-|r(;.)P  -^r(z) 


for  some  function  r  with  ||r||Loo(M)  <  1.  If  y{x,t  =  0)  G  5(M),  the  Schwartz  space  on 
M,  then  also  r(z)  G  5(M).  And  conversely  if  r  G  S{M)  is  given  with  ||r||j,oo(]R)  <  1, 
inverse  scattering  theory  tells  one  that  there  exists  a  unique  function  y  E  S(M)  with 

/  I  —  |rp  —f\ 

associated  jump  matrix  v  =  v{z)  =  I  ^  ^  1 .  The  function  r  =  r{z)  is 

called  the  reflection  coefficient  for  the  scattering  problem  ^  -  P^. 

If  we  set  m  =  as  above,  then  from  we  obtain 


d  .  r  1  /  0  2/ 


Also  it  is  a  fact  (see  [FT]  or  [BC])  that,  for  fixed  x. 


m{x,z)  =  /  +  ^^^  +  0 


as  — >  oo. 


Substituting  this  asymptotic  expansion  into  the  above  differential  equation  for  m, 
we  obtain  i[(Tz,  mi]  “  ^  q  ^  j  which  establishes  the  basic  formula 


(11)  y{x)  =  2i{mi{x))i2. 

The  preceding  calculations  can  be  summarized  as  follows.  Let  yo{x)  G  S{M) 
be  given  and  let  r  =  r{z)  be  the  corresponding  reflection  coefficient.  Set  = 

Theorem  3  Let  m{z)  —  m{xfl,z)  solve  the  Riemann-Hilbert  (RH)  problem 

•  m  =  m(-)  =  m{x,t,  •)  is  a  2  x  2  matrix  valued  analytic  function  in  C\IR, 

•  m^{z)  =  m^{z)vx,t(^)^  ^ 

•  m{z)  — )■  7  as  z  oo. 

Then  if  m  =  7  -f  +  0  (^)  as  z  ->  oo, 

(12)  y{xfl)  ^2i{mi{xfl))i2 
solves  the  NLS  equation  with  initial  data  y{x,  0)  =  yo{x). 
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Thus  we  see  that  to  analyze  the  solution  of  NLS  we  must  solve  a  Riemann-Hilbert 
problem  and  control  the  solution  of  the  problem  as  x,t  become  large.  We  will  see 
how  to  do  this  in  the  next  lecture. 

Now  the  point  I  want  to  make  here  is  that  the  above  RH  formulation  should  be 
understood  in  the  classical  R-matrix  framework  (see  [FT]  for  details) .  Indeed  we 
saw  that  if  we  are  in  an  R-matrix  situation,  the  solution  of  the  equation  generated  by 
an  invariant  Hamiltonian  can  be  solved  by  a  factorization  problem.  For  example,  in 
the  case  of  the  Toda  lattice  on  the  abstract  factorization  problem  guaranteed 
by  the  R-matrix  formalism  takes  the  concrete  form  of  a  finite-dimensional  matrix 
factorization,  =  QR.  For  NLS,  as  we  see  above,  the  abstract  factorization  takes 
the  form  of  a  Riemann-Hilbert  factorization. 

This  theme,  I  believe,  is  a  unifying  concept  in  the  theory  of  integrable  systems. 
The  more  refined  conjecture  mentioned  above  is  the  following:  not  only  is  every 
integrable  system  integrable  because  there  is  an  associated  Lax-pair,  but  even  more, 
in  the  background,  there  is  an  R-matrix,  and  hence  a  factorization  problem,  whose 
solution  gives  the  solution  of  the  basic  dynamical  system. 

Remark  1.  The  first  person  to  recognize  that  an  inverse  scattering  problem  was 
equivalent  to  a  Riemann-Hilbert  problem  was  Shabat  [Sj. 

Remark  2.  In  the  infinite-dimensional  setting,  questions  arise  about  the  meaning 
of  “complete  integrability” .  In  the  finite-dimensional  case,  of  dimension  2n,  say,  we 
need  n  (commuting)  integrals  to  integrate  the  system.  In  the  infinite-dimensional 
case,  what  does  it  mean  to  have  “half”  the  integrals?  In  what  sense  can  we  add 
one  integral  in  or  take  one  out?  When  do  we  have  enough?  These  questions,  and 
more,  were  considered  in  the  case  of  KdV  with  periodic  boundary  conditions,  for 
example,  by  McKean  and  Trubowitz,  in  their  beautiful  paper  [MT].  In  the  scattering 
situation,  however,  these  are  in  general  only  a  finite  number  of  L^-eigenvalues,  and 
these  must  be  complemented  with  a  “continuum  of  integrals”  constructed  from 
scattering  theoretic  quantities  (see  [FT]).  In  these  four  lectures  we  unfortunately 
do  not  have  the  time  to  develop  these  elegant  considerations:  rather,  we  take  the 
practical  point  of  view  that  a(n  infinite-dimensional)  system  is  integrable  if  we  can 
construct,  by  hook  or  by  crook,  a  one-to-one  change  of  variables  which  linearizes 
the  flow.  For  example  for  NLS,  we  have 

y(x,t)  =  r^,t  =  logr^^^t  =  \ogr +  2i{2tz^  +  xz). 

Analyzing  the  inverse  map,  3/(®,<),  we  are  able  to  describe  the  behavior,  and 

in  particular,  the  asymptotic  behavior  of  the  flow. 
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Lecture  4 

The  Liouville-Arnold-Jost  Theorem  teaches  us  that  the  flow  generated  by  a  finite¬ 
dimensional  integrable  system  is  equivalent  to  a  straight  line  motion  on  a  cylinder 
jk  If  =  n,  say,  the  cylinder  is  a  torus  and  the  motion  is  almost  periodic 

in  time,  etc.  In  the  integrable,  infinite-dimensional  case,  we  still  have  straight  line 
motion  on  a  (now  infinite-dimensional)  cylinder,  but  when  we  invert  the  linearizing 
map,  the  implications  of  the  straight  line  motion  for  the  flow  in  physical  x-sp^ce  will 
depend  on  the  topology  we  impose  (see  the  discussion  in  the  middle  of  Lecture  3). 
Special  tools  are  needed  to  analyze  the  inverse  map  and  again,  without  apologies, 
we  restrict  our  attention  to  the  defocusing  NLS  equation. 

We  are  interested,  in  particular,  in  the  long-time  behavior  of  the  solution  y[xjt) 
of  NLS  with  initial  data  2/(x,0)  =  yQ(x)  e  «S(M).  By  Theorem  3,  this  reduces  to 
controlling  the  large  x^i  behavior  of  a  Riemann-Hilbert  problem.  Now  a  straight¬ 
forward  calculation  (Exercise)  shows  that  if  [|r||/,oo(]g^  1,  then 

y{x,i)  =  2i{mi{x,t))i2 

/oo  _ 

-OO 

and  the  long-time  behavior  of  y{x,t)  can  be  computed  to  leading  order  using  the 
standard  method  of  stationary  phase.  In  other  words,  if  we  think  functorially,  then 
there  exists  some  (highly  nonlinear)  functional  y{x,t)  =  .F(r(-)e*(^*(‘) 
which  reduces  when  r  is  small  to  the  ordinary  Fourier  transform,  to  which  the 
standard  methods  of  stationary  phase  then  apply. 

This  raises  the  following  question!  Does  there  exist  some  kind  of  “nonlinear” 
steepest-descent/stationary  phase  type  method  which  can  be  applied  directly  to 
(11),  even  when  is  not  small? 

More  generally,  we  are  looking  for  a  steepest-descent/stationary  phase-type 
method  that  applies  directly  to  oscillatory  Riemann-Hilbert  problems  (of  the  type 
occurring  in  Theorem  3  above,  for  example)  as  the  external  parameters  (x  and  t  in 
our  case)  become  large. 

What  features  should  the  method  have?  First  let  us  review  the  classical  method 
of  stationary  phase  to  determine  the  asymptotic  behavior  as  t  ->  oo  of  an  integral 
F{i)  =  on  a  contour  (7,  say,  where  f{z)  and  g{z)  are  analytic. 

Step  1  Locate  the  points  zj  of  stationary  phase,  g'{zj)  =  0. 

Step  2  Determine  paths  of  steep(est)-descent  in  the  neighborhood  of  the  Zj's:  as 
one  moves  along  these  paths  away  from  the  stationary  phase  points, 
should  decrease  exponentially. 

Step  3  Using  Cauchy’s  Theorem,  deform  C  to  coincide  with  these  paths  of  steep(est)- 
descent  near  the  Zj’s. 

Step  4  As  t  ->  oo,  the  problem  is  localized  to  the  part  of  the  deformed  contour 
that  lies  in  an  e-neighborhood  of  each  Zj]  the  contribution  from  the  remainder 
of  the  contour  is  exponentially  small. 

Step  5  The  localized  integrals  can  be  computed  explicitly:  indeed,  after  scaling 
^  ^  ^  -  Zj)y/t,  the  integrals  reduce  to  Gaussian  integrals  on  M.  Thus 

the  leading  asymptotics  of  F{t)  can  be  computed  explicitly. 
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Recently  Xin  Zhou  and  I  [DZl]  have  discovered  a  method  for  RH  problems  that 
is  the  analog  of  the  above  classical  method  for  integrals.  In  addition  to  the  NLS 
equation,  the  method  has  now  been  used  to  obtain  the  asymptotics  of  a  wide  variety 
of  problems  ranging  from  integrable  wave  equations  such  as  Modified  KdV  [DZl] 
and  KdV  [DVZl]  etc.,  to  “integrable”  ode’s  such  as  the  Painleve  equations  [DZ3], 
to  integrable  discrete  models  such  as  the  Toda  lattice  [K],  to  integrable  statistical 
mechanical  models  such  as  the  transverse  Ising  chain  at  the  critical  magnetic  field 
[DZ2]  and  also  the  so-called  XY  model  [DIZ2],  to  small  dispersion  problems  [DVZ2], 
and  finally,  most  recently,  to  the  theory  of  random  matrix  models  [DIZ2] . 

The  application  of  the  method  to  the  NLS  equation,  which  is  the  main  subject 
of  this  lecture,  was  developed  together  with  Alexander  Its  and  X.  Zhou  [DlZlj;  for 
a  more  detailed  exposition,  refer  to  [DZ4],  which  is  also  intended  as  a  pedagogic 
introduction  to  the  general  steepest-descent  method  for  RH  problems. 

So  how  does  the  method  work?  Set  0(z)  =  2z^-\-jz  .  Step  1  is  the  same.  Setting 
0\zq)  =  Azq  -f  ~  0,  we  obtain  the  stationary  phase  point  zq  —  —xjM.  To  describe 

the  analog  of  Step  3,  we  must  explain  what  it  means  to  “deform”  a  RH  problem. 
What  we  mean  is  that  we  can  replace  a  given  RH  problem  on  one  contour  by  an 
equivalent  RH  problem  on  another  contour  —  the  one  problem  is  solvable  if  and 
only  if  the  other  is  solvable. 

To  be  explicit,  we  consider  more  generally  RH  problems  on  oriented  contours 

ScC 


z 


with  given  jump  matrices  v:  E  — >  G/(n,  C).  If  E  is  unbounded,  we  always  assume 
v(z)  —y  I  sufficiently  rapidly  as  |2:|  ^  oo.  Recall  that  the  orientation  on  an  arc  is 
equivalent  to  specifying  the  ±  sides  of  the  arc,  e.g., 
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By  convention,  the  +  side  is  taken  to  lie  on  the  left  as  we  traverse  the  arc  in  the 
direction  of  the  orientation. 

Given  (E,  v)  we  seek  an  n  x  n  matrix- valued  function  m  which  is 

♦  analytic  in  C\S, 
and  satisfies, 

♦  m^(z)  =  m^{z)v(z)  for  z 

♦  m(z)  ->  /  as  oo. 

This  is  the  RH  problem  determined  by  (S,  v).  It  is  not  at  all  clear  that,  given  E  and 
V,  the  RH  problem  has  a  (unique)  solution,  and  indeed,  a  beautiful  and  profound 
mathematical  theory  has  been  developed  over  the  years  to  consider  such  questions 
(see,  e.g.,  [CG]).  Fortunately  our  RH  problem  in  Theorem  3  has  a  unique  solution 
(see  e.g.,  [DZ4]  for  details). 

Now  suppose  we  have  a  RH  problem  (E,  v) 

z 


and  suppose  that  there  are  two  distinguished  points  zi  and  Z2  on  E  which  support 
a  circular  disk  D  intersecting  E  as  follows, 


Suppose  that  within  the  disk  v{z)  has  a  factorization  v{z)  -  bj{z)vi{z)b^{z), 
z  G  E  n  D,  where  b±(z)  have  analytic  continuations  to  the  zb  segments  of  the  disk, 
respectively!  '^1(^)1  however,  has  no  analyticity  properties.  Let  m  be  the  solution  of 
the  RH  problem  determined  by  (E,!;). 

Define  m(z)  on  the  complement  of  the  extended  contour  E  =  E  U  dD,  as  follows: 
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m  =  m 


Observe  that 

•  mis  analytic  in  C\E , 

•  satisfies  m+  =  in^v  on  E, 

•  m(z)  — >■  /  as  ;2:  oo, 

where  v  is  defined  on  E  by 


The  RH  problem  corresponding  to  (E,i;)  is  the  deformed  RH  for  (E,  v);  clearly 
the  two  problems  are  equivalent  in  the  sense  stated  above. 

For  the  RH  problem,  what  do  we  mean  by  paths  of  steep (est)-descent  as  in 
Step  2?  Somehow  we  feel  that  the  main  contribution  to  the  solution  of  the  RH 
problem  should  be  coming  from  the  stationary  phase  point  Zq.  The  key  to  the 
problem  is  the  signature  table  of  Re  iO  =  Re  i{2z‘^  —  4zq  z).  In  the  z-plane  we  have 


Re  i0>O 

Re  i  0  <  0 

Re  i  6  <0 

Re  1 0  >  0 

What  we  would  like  is  to  “put”  the  factor  in  Vx,t  in  the  region  where 
Re  iO  <  0,  and  the  factor  in  the  region  where  Re  i9  >  0:  in  that  way  the  RH 
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problem  would  clearly  localize  to  a  neighborhood  of  >2:0,  as  i  00.  This  would  then 
be  the  RH  analog  of  Step  2.  But  how  can  we  do  it?  Algebraically  the  two  factors 
are  locked  together  in  the  same  expression.  The  baisic  observation  is  that  we  can 
factor 


(  1  -  |r(z)p  -re-2>-'« 
1 

=  bzH^. 


1  \  (  I  0  \ 

0  1  y  V  1  ) 


Assuming  for  the  moment  that  r[z)  has  an  analytic  continuation  to  Im  >  0,  and 
hence  r{z)  =  r[z)  has  an  analytic  continuation  to  Im  <  0,  we  see  that  for  z  >  zq 
in  particular,  this  gives  rise  to  a  deformed  RH  problem  in  the  sense  described  above 
(here  Z\  ZQyZ2  ^  00) , 


Clearly  this  is  what  we  want  as  h^[z)  I  for  2:  on  the  upper  leg  and  b^{z)  I  for 
2:  on  the  lower  leg,  as  i  ->  00  (see  signature  table  of  Re  iO).  But  if  we  try  the  same 
trick  for  2:  <  2^0 ,  we  obtain 


which  is  clearly  not  what  we  want  as  6^,61^  now  blow  up  exponentially.  What  we 
want  to  do  somehow  is  to  reverse  the  order  of  appearance  of  the  multipliers 
in  the  factorization  of  Vx,t{z).  Fortunately,  this  can  be  achieved  by  performing  a 
lower/upper,  as  opposed  to  an  upper/lower,  factorization  of 


1 

r  ^2ite 
l~|rpe 


0 

1 


(l-|r(z)p)  0 

0  (i-K^)l^)-' 


1  -r  -2ite  \ 

0  1  J 
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(Recall  that  <  1  so  that  1  —  |r(2:)p  ^  0.)  To  remove  the  diagonal  factor 

set 


where  S  is  an  as  yet  undetermined  function,  which  we  will  choose  below  to  be 
analytic  and  invertible  in  C  \  (— oo,  2:0]- 

Simple  algebra  shows  that  solves  a  RH  problem  on  =  E  =  M  with 


for  z  <  Zq. 


We  choose  5  to  solve  the  following  scalar  RH  problem, 

•  6[z)  is  analytic  in  C  \(— 00,  ^tq], 

•  <5+(2)  =  J_(2)(l  -  |r(2)p),z  <  Zo, 

=  J_(z),  Z>Zo 

•  8[z)  1  as  ^  00. 

Such  scalar  problems  can  be  solved  explicitly  by  formula. 


With  5  as  above,  takes  the  form 
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with  jump  matrices  as  indicated  above.  Referring  to  the  signature  table  for 
Re  iO,  we  see  that  all  the  jump  matrices  converge  expoi^ntially  to  the  identity, 
uniformly  on  the  complement  of  ^-neighborhoods  of  Zq  in 

What  if  r  is  not  analytic?  In  this  case,  for  we  factor  ~ 

r{z^t)  +  r\[z^t)^  where  r(-,t)  is  analytic  and  ri{z,t)  is  small  as  t  — >■  oo,  and  for 
z  <  Zq  factor  r(l  —  similarly.  Instead  of  (12),  this  leads  to  a 

factorization  where  vx  ->■  1  rapidly  as  t  oo.  Under  deformation 

this,  in  turn  leads  to  a  RH  problem  of  the  form 


But  as  Vx  I  rapidly,  we  again  end  up,  to  any  desired  order,  with  a  RH  problem 
on  a  cross  of  type  as  before.  Either  way,  for  any  e:  >  0,  the  RH  problem 

localizes  to  a  RH  problem  on  a  small  cross  of  size  s,  as  t  ^  oo 


This  completes  Steps  1,  2,  3  and  4. 

In  the  analog  of  Step  5,  we  again  scale  z  {z  --  zo)\/i  and  this  leads  to  a  RH 
problem  on  an  infinite  cross,  but  now  the  jump  matrices  have  constant  coefficients. 
At  this  stage  all  the  analytical  tricks  have  been  used  up  and  there  are  no  small 
parameters  left  to  scale.  Like  the  Gaussian  integral  that  you  just  have  to  compute 
in  the  classical,  scalar  case,  this  is  a  problem  you  just  have  to  solve:  either  the 
heavens  are  with  you  or  against  you.  Fortunately,  in  this  problem  they  are  with 
you  and  it  turns  out  that  this  local  RH  problem  can  be  solved  explicitly  in  terms 
of  parabolic  cylinder  functions.  We  thus  obtain  the  leading  order  behavior  of  the 
solution,  with  controlled  error  terms.  Furthermore,  the  method  is  algorithmic  and 
we  do  not  need  to  make  an  a  priori  ansatz  for  the  asymptotic  form  of  the  solution. 

The  kind  of  result  that  we  obtain  is  the  following. 

Let  Vasi^^yt)  =  zq  =  -xjAt,  where 


(^o)  = 


log(l  -  |r(«o)P)  >  0, 


(zo)P 
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argQf(zo)  =  -  /  Iog(;2ro  -5)c/(log(l  -  lr(5)p)) 

^  J -oo 

+  ^  +  argr(2V(2:o))  -  argr(2:o) 

(r  =  gamma  function) .  Fix  M  >  1. 

Theorem  4  Let  y{x,t)  solve  NLS  with  y{x,i  =  0)  t/o(a?)  G  S{U).  Then  as 

t  -A  oo 

(i)  y(x^  t)  =  ya^i^,  t)  O  for  \zq\  =r  |  — ^|  < 

(ii)  for  any  j, 


3/(a:.<)  =!/a..(«.<)  +  C>  +  ,  |zo|  > 

where  Cj{zQ)  decreases  rapidly  as  |zo|  -A  oo. 

Note  that  the  asymptotics  match  in  the  common  region  M~^  <  |  — ^|  <  M . 
Remarks 

(1)  The  asymptotic  form  of  the  solution  y  --  yas  for  NLS  was  first  obtained  by 
Zakharov  and  Manakov  [ZM],  using  a  so-called  “isomonodromic-type”  technique. 
The  calculations  in  [ZM],  however,  contain  no  error  bounds. 

(2)  The  scaled  RH  problem  arising  in  Step  5  above,  first  arose,  and  was  solved,  in 
the  work  of  Its  and  Novokshenov  (see  [IN]). 

The  steepest-descent/stationary  phase  type  method  presented  above  for  RH 
problems  is  a  kind  of  nonlinear  Fourier  integral  method,  and  we  may  reasonably 
raise  the  following  question:  do  the  analogs  of  singular  optical  phenomena,  such  as 
caustics,  occur  in  the  non-linear  problem?  The  NLS  equation  is  indeed  the  simplest 
case  because  there  are  no  caustics  i.e.  the  pair  of  equations 


has  no  solutions.  In  the  case  of  the  Modified  KdV  equation,  however,  there  is  a 
similar  RH  to  NLS,  but  now  6  is  replaced  by  ^MKdV  ~  T  For  this  problem 
there  are  indeed  caustics: 

£  (4.3  +  |,)  ^  0  z  =  ±zo  =  ±y-^,  r<0. 
i  (4^3  +  ^  0  ^  ^  ^  0 

Thus  ic/t  — )■  0  is  a  “caustic”  region,  and  indeed  there  are  two  main  regions  for 
the  asymptotic  behavior  of  MKdV  (see  [SA],  [DZl]).  In  the  first  region,  < 

\/~12i  <  M ,  X  <  0,  the  stationary  phase  points  ±zo  are  separated  and  do  not 
interact  to  leading  order  and  the  solution  is  a  sum  of  two  terms  of  the  same  type  as 
the  asymptotic  form  yas  occurring  in  NLS.  In  the  caustic  region,  |^|  <  c,  however, 
the  solution  y{x,i)  of  Modified  KdV  now  takes  a  new  and  different  form,  y 
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(j^)  where  u  -  u{s)  solves  the  Painleve  II  equation  'u"(s)  =  sw(s)  +  2u^{s). 
Again  there  are  overlapping  regions,  etc. 

A  problem  which  has  two  stationary  phase  points  but  no  caustic  region  arises 
in  evaluating  the  long-time  asymptotics  for  the  autocorrelation  function  x{i)  — 
(crJ(t)f7Q)T  at  temperature  T  for  the  transverse  Ising  chain  at  the  critical  mag¬ 
netic  field  with  Hamiltonian  H  =  -  +  ^1)-  PZ2],  Zhou  and  I  show 


that 


log  I  tanh(5/T)|(iif  +  0(logi) 


Now,  so  far,  we  have  presented  the  RH  method  as  a  nonlinear  analog  of  the 
classical  method  of  stationary  phase.  But  it  turns  out  that  there  are  genuinely 
nonlinear  phenomena  which  can  arise.  For  example  for  KdV,  which  is  closely 
related  to  the  Modified  KdV  equation,  a  third  region,  called  the  collisionless  shock 
region  and  discovered  by  Ablowitz  and  Segur  [AS],  appears  between  the  regions 
<  \Fwt  <  in  describing  the  long-time  behavior  of  the 

solution.  In  this  region,  which  is  described  by  the  inequalities  <  ti/3(iogt)V3'  < 
Cl,  the  solution  of  KdV  behaves  like  a  modulated  cnoidal  wave.  As  is  shown  in 
[DVZl],  this  region  arises  from  the  fact  that  the  full  line  joining  the  stationary 
phase  points 


-2: 


0 


2: 


0 


becomes  “electrified”  and  each  point  on  the  line  contributes  equally  to  the  long-time 
behavior  of  KdV.  Thus  in  the  steepest-descent /stationary  phase  method  for  RH 
problems  we  can  have  “lines  of  stationary-phase”,  and  not  just  points  of  stationary 
phase. 

Remarks  A  complete  description  of  the  leading  asymptotics  of  the  solution  of 
the  Cauchy  problem  for  KdV,  with  connection  formulae  between  different  regions 
(including  the  collisionless  shock  region,  which  is  their  discovery!)  was  presented 
by  Ablowitz  and  Segur  [AS],  but  without  precise  information  on  the  phase  (and 
without  error  estimates).  In  a  later  development,  Segur  and  Ablowitz  [SA]  used  a 
modification  of  the  method  in  [ZM]  to  derive  the  leading  asymptotics  of  the  solutions 
of  the  MKdV,  KdV  (but  not  in  the  collisionless  shock  region)  and  Sine-Gordon 
equations,  including  full  information  on  the  phase.  For  more  historical  information 
on  the  contributions  of  many  authors  to  the  calculation  of  the  asymptotic  behavior 
of  integrable  systems,  see,  for  example,  [DIZl]. 
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The  Complex  Ginzburg-Landau  Equation 
as  a  Model  Problem 

C.  David  Levermore  and  Marcel  Oliver 

Abstract.  The  generalized  complex  Ginzburg-Landau  (CGL)  equation  de¬ 
scribes  the  evolution  of  a  complex-valued  field  u  =  u{x^  t)  by 

dtu  =  -j-  (1  -I-  iz/)Au  -  (1  -f  . 

It  has  a  long  history  in  physics  as  a  generic  amplitude  equation  near  the  on¬ 
set  of  instabilities  that  lead  to  chaotic  dynamics  in  fluid  mechanical  systems, 
as  well  as  in  the  theory  of  phase  transitions  and  superconductivity.  In  this 
role  it  and  equations  like  it  have  had  remarkable  success  in  describing  evolu¬ 
tion  phenomena  in  a  broad  range  of  physical  systems,  from  fluids  to  optics 
More  recently,  it  has  been  proposed  and  studied  as  a  model  for  “turbulent” 
dynamics  in  nonlinear  partial  differential  equations.  It  is  a  particularly  inter¬ 
esting  model  in  this  respect  because  it  is  a  dissipative  version  of  the  nonlinear 
Schrodinger  equation,  a  Hamiltonian  equation  which  can  possess  solutions  that 
form  localized  singularities  in  finite  time. 

This  article  summarizes  the  status  of  well-posedness  and  regularity  ques¬ 
tions  for  the  CGL  equation  subject  to  periodic  boundary  conditions  in  any 
spatial  dimension.  It  also  discusses  the  relationship  of  these  results  to  the 
above  investigations  and,  in  particular,  to  analogous  issues  for  Navier-Stokes 
fluid  turbulence.  Appropriately  defined  weak  solutions  exist  globally  in  time 
and  unique  classical  solutions  are  found  locally.  Conditions  are  given  under 
which  these  classical  solutions  exist  globally  in  time  and  have  bounds  on  all 
derivatives  that  are  uniform  over  large  times,  thereby  providing  the  setting 
in  which  ideas  from  finite  dimensional  dynamical  systems  can  be  applied  to 
the  CGL  equation.  Moreover,  by  using  a  characterization  of  Gevrey  classes  in 
terms  of  decay  of  Fourier  coefficients,  these  solutions  are  shown  to  be  analytic 
for  positive  times.  Refinements  in  both  some  of  the  theorems  and  some  of  the 
proofs  are  introduced. 
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1.  Introduction 

The  description  of  chaotic  dynamics  or,  more  dramatically,  turbulence  in  con¬ 
tinuum  systems  by  nonlinear  evolution  equations  (often  partial  differential  equa¬ 
tions)  is  one  of  the  central  challenges  facing  theoretical  physics  and  applied  math¬ 
ematics.  We  would  like  to  be  able  to  apply  ideas  from  the  studies  of  dynamical 
systems  or  statistical  physics  to  these  evolution  equations  in  order  to  gain  a  deeper 
understanding  of  these  phenomena  and,  as  a  byproduct,  to  hopefully  achieve  sim¬ 
pler  or  more  tractable  descriptions.  One  of  the  main  mathematical  stumbling  blocks 
to  studying  these  questions  often  lies  in  our  lack  of  understanding  of  even  the  most 
basic  issues  of  existence,  uniqueness  and  regularity  for  solutions  of  the  underlying 
evolution  equations.  This  situation  has  led  mathematicians  and  physicists  either 
to  derive,  usually  through  systematic  asymptotics,  or  to  propose,  usually  in  an  ad 
hoc  manner  based  on  intuition  and  symmetries,  simpler  equations  for  which  these 
mathematical  issues  can  be  addressed  with  greater  success. 

One  such  equation  is  the  generalized  complex  Ginzburg-Landau  (CGL)  equa¬ 
tion,  which  describes  the  evolution  of  a  complex- valued  field  u  =  u{x,t)  over  a 
d-dimensional  spatial  domain  by 

(1.1)  dtu  =  Ru-\-  {l-\-  iu)Au  -  (1  +  2//) \u\'^'^u . 

Here  the  parameters  R  and  a  are  positive  while  u  and  fjb  are  real.  Without  loss  of 
generality,  the  equation  has  been  normalized  so  that  the  coefficients  of  the  linear 
and  nonlinear  dissipation  (damping)  terms  are  unity.  The  R  is  the  coefficient  of 
the  linear  driving  term,  without  which  all  solutions  would  decay  to  zero.  Clearly, 
a  sets  the  degree  of  the  nonlinearity.  The  u  and  /x  are  the  coefficients  of  the  linear 
and  nonlinear  dispersive  terms  respectively. 

The  CGL  equation  with  a  cubic  nonlinearity  (a  =  1)  has  a  long  history  as  a 
generic  amplitude  equation  derived  asymptotically  near  the  onset  of  instabilities  in 
fluid  dynamical  systems.  The  case  with  real  coefficients  was  first  derived  by  Newell 
and  Whitehead  [65]  and  Segel  [74]  to  describe  Benard  convection.  The  case  with 
complex  coefficients  was  put  forth  in  a  general  setting  by  Newell  and  Whitehead  [66] 
and  DiPrima,  Eckhaus,  and  Segel  [20],  and  was  shown  by  Stewartson  and  Stuart 
[76]  to  apply  to  plane  Poiseuille  flow.  The  reader  may  also  want  to  consult  more 
comprehensive  reviews  of  the  application  of  amplitude  equations,  such  as  the  CGL 
equation  to  chemical  reactions  [51]  and  to  pattern  formation  [17,  64].  Here  we 
remove  it  from  any  particular  physical  context  and  investigate  it  as  a  mathematical 
model  of  a  variety  of  possible  phenomena  in  nonlinear  partial  differential  equations. 
Considered  over  the  d-dimensional  torus  the  CGL  equation  is  both  theoretically 
and  numerically  tractable,  and  has  proved  fruitful  for  illustration  of  connections 
between  infinite  dimensional  and  finite  dimensional  dynamical  systems  [3,  24,  73]. 
We  shall  therefore  focus  our  study  on  the  spatially  periodic  CGL  equation  (1.1) 
and  point  out  those  results  that  extend  to 

This  lecture  will  present  to  nonspecialists  one  overview  of  how  the  generalized 
CGL  equation  can  be  viewed  as  a  dynamical  system.  We  have  therefore  tried  not 
to  presume  familiarity  with  material  beyond  that  from  standard  graduate  level 
analysis  and  functional  analysis  courses.  The  style  will  be  expository.  We  will  give 
a  big  picture  along  with  enough  detail  to  clarify  the  strategies  that  are  used  to  prove 
many  basic  theorems.  Without  any  pretense  of  being  exhaustive,  we  review  several 
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of  the  avenues  of  investigation  taken  in  the  literature,  all  of  which  have  application 
far  beyond  the  context  of  the  CGL  equation. 

The  material  is  organized  as  follows.  Section  2  begins  to  paint  the  CGL  phase 
portrait  by  identifying  some  special  solutions,  examining  their  stability,  and  de¬ 
scribing  how  chaotic  behavior  might  arise.  Section  3  presents  the  attributes  of  the 
CGL  equation  that  allow  it  to  be  a  focus  for  general  studies  of  turbulent  behavior. 
In  particular  its  relation  to  the  nonlinear  Schrodinger  (NLS)  equation, 

(1-2)  dtu  ~  iv/^u  —  ifjb\u\^^u  ^ 

is  discussed  as  an  analogue  to  the  relation  of  the  driven  Navier-Stokes  equations 
to  the  Euler  equations  in  fluid  dynamics.  The  existence  of  global  (in  time)  weak 
solutions  to  the  CGL  equation  for  LP'  initial  data  is  proved  in  Section  4.  Such 
solutions  exist  in  any  spatial  dimension  and  degree  of  nonlinearity.  The  result  is 
similar  in  spirit  to  that  of  Leray’s  existence  theorem  for  global  weak  solutions  to 
the  Navier-Stokes  equations.  Also  similarly,  the  question  of  uniqueness  for  weak 
solutions  remains  open.  Section  5  discusses  the  existence  and  uniqueness  of  local 
(in  time)  classical  solutions — even  solutions  provided  cr  is  a  positive  integer. 

Initial  data  in  the  classes  W  and  are  considered.  In  cases  when  s  can  be 

negative  this  will  include  initial  data  that  are  distributions.  Section  6  presents 
local  estimates  which  show  that  when  a  is  an  integer  the  solutions  are  not  only 
but  also  contained  in  a  subclass  of  the  real  analytic  functions.  The  derivation 
of  these  estimates  employs  a  characterization  of  Gevrey  classes  in  terms  of  decay 
of  Fourier  coefficients.  Our  inability  to  unconditionally  extend  the  local  solutions 
to  global  ones  is  a  manifestation  of  the  instability  in  the  inviscid  dynamics.  One 
cannot  generally  exclude  the  possibility  of  a  finite-time  blow-up,  even  in  dissipative 
systems.  Section  7  establishes  global  classical  solutions  through  cl  ‘pvioTi  bounds  of 
norms  and  does  not  restrict  a  to  integer  values.  When  <t  is  a  positive  integer, 
direct  global  bounds  on  every  norm  are  derived  in  Section  8.  The  role  of 
these  results  in  applying  dynamical  systems  methods  to  the  study  of  the  long-time 
behavior  of  CGL  solutions  is  discussed. 

2.  The  CGL  Equation  as  a  Model  for  the  Onset  of  Chaos 

This  investigation  has  two  main  themes.  The  first  theme  is  to  gain  as  complete 
an  understanding  as  possible  of  the  CGL  “phase  portrait”  in  any  infinite  dimen¬ 
sional  phase  space  in  which  the  flow  is  defined  globally  in  time.  The  second  theme  is 
to  justify  the  asymptotic  derivation  of  the  CGL  equation  as  an  amplitude  equation 
and  to  determine  whether  the  CGL  phase  portrait  can  be  related  to  the  onset  of 
chaos  in  solutions  of  the  original  dynamical  system.  This  question  will  be  taken  up 
in  the  lecture  of  A.  Mielke  [63]  and  is  also  reviewed  in  [17,  51],  so  it  will  not  be 
discussed  here. 

Crucial  to  the  understanding  of  the  phase  portrait  of  any  dynamical  system 
is  the  nature  of  the  long-time  asymptotics.  Our  main  focus  will  be  to  establish 
existence  and  properties  of  a  globally  attracting  set.  Such  a  set,  the  global  attractor, 
is  compact,  invariant  under  the  flow,  and  attracts  all  orbits  uniformly  over  bounded 
sets  of  initial  data  [29,  78].  Naturally,  the  attractor  contains  the  a;-limit  set  and 
the  unstable  manifold  of  the  u;-limit  set  of  every  orbit.  The  nature  of  orbits  on  and 
about  the  attractor  provides  valuable  information  for  understanding  the  mechanism 
for  the  onset  of  chaos. 
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Our  investigation  begins  by  identifying  classes  of  solutions  that  are  known  ex¬ 
plicitly  and  which  are  therefore  amenable  to  local  analysis  using  linear  and  nonlinear 
methods.  First,  the  number  of  linearly  unstable  directions  gives  a  lower  bound  on 
the  dimension  of  the  unstable  manifold,  thereby  providing  a  lower  bound  on  the 
dimension  of  the  attractor.  This  analysis  was  first  carried  out  in  detail  for  the 
cubic  CGL  equation  for  d  =  1  by  Doering,  Gibbon,  Holm  and  Nicolaenko  [24]  and 
for  d  =  1,  2  by  Ghidaglia  and  Heron  [35].  Second,  we  will  indicate  the  nonlinear 
techniques  used  to  paint  a  picture  of  the  bifurcations  from  these  solutions  as  R 
increases. 

To  illustrate  the  procedure,  we  first  linearize  the  CGL  equation  (1.1)  about  the 
zero  solution  and  obtain 

(2.1)  dtu  —  Ru-\- {I . 

Then  a  small  perturbation  with  a  pure  Fourier  mode  u{x,  t)  =  a{t)  exp(i^-a:)  where 
^  G  27rZ^  (=  (27rZ)^)  evolves  according  to 

(2.2)  dta  =  i?d  -  (1  +  . 

This  shows  that  with  respect  to  this  perturbation  the  zero  solution  is  linearly  un¬ 
stable  when  ICP  <  R,  linearly  asymptotically  stable  when  >  R,  and  linearly 
neutrally  stable  when  =  R,  Hence,  the  dimension  of  the  subspace  of  linear 
instability,  which  gives  a  lower  bound  on  the  dimension  of  the  globally  attracting 
set,  grows  like  as  R  becomes  large. 

The  simplest  class  of  nontrivial  spatially  periodic  solutions  is  comprised  of 
rotating  wave  solutions,  which  have  the  form 

(2.3)  u{x,  t)  =  a  exp  (z(C-a:  -  a;t)) , 
where  a  G  C,  u;  G  E,  and  ^  G  27rZ'^  satisfy 

(2.4)  =  = -R  -  1^ P  • 

The  largest  of  these,  corresponding  to  f  =  0,  is  the  spatially  homogeneous  solution, 
sometimes  referred  to  as  the  Stokes  solution.  For  each  wave  vector  ^  such  that 
If  P  <  i?,  formulas  (2.3)  and  (2.4)  define  a  circle  of  rotating  wave  solutions  that 
is  parameterized  by  the  phase  of  a.  These  solutions  that  play  an  important  role 
in  many  physical  applications  and  their  existence  is  one  of  the  reasons  that  the 
CGL  initial- value  problem  posed  in  ^^(T^)  is  more  interesting  than  that  posed  in 
L^(E‘^).  Notice  that,  as  R  increases,  the  circle  of  rotating  waves  corresponding  to 
wave  vector  f  bifurcates  off  the  zero  solution  at  =  |f  p,  thereby  corresponding  to 
the  addition  of  two  real  dimensions  to  the  subspace  of  linear  instability  of  the  zero 
solution  for  each  such  f  as  i?  increases  beyond  |f  p.  There  are  on  the  order  of 
such  circles  as  R  becomes  large. 

Due  to  the  simple  form  of  the  rotating  waves  (2.3),  their  linear  stability  can 
be  analyzed  exactly.  This  is  not  something  one  can  do  for  any  nontrivial  solution 
of  any  partial  differential  equation  because  the  associated  linearized  equation  will 
generally  have  coefficients  that  depend  on  both  space  and  time.  Indeed,  a  naive 
linearization  of  the  CGL  equation  (1.1)  about  a  rotating  wave  (2.3)  leads  to  a 
variable  coefficient  equation.  However,  if  one  introduces  a  perturbation  of  the  form 

(2.5)  u{x,  t)  —  a  exp  (i(f -x  -ojt)){l-\-e  u{x,  t)) , 
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then  the  resulting  linearization  is 

(2.6)  dtu  —  2(1  +  •  Vu  +  (1  +  iz/)Aw  —  (1  +  i^)cr\a\^^ [u  +  u*) . 

This  equation  can  be  viewed  as  a  real  linear  system  for  the  real  and  imaginary  parts 
of  u  (alternatively,  for  u  and  u*)  with  constant  coefficients,  which  can  therefore  be 
analyzed  using  Fourier  modes.  Due  to  the  multiplicative  form  of  the  perturbation 
(2.5),  the  Fourier  mode  of  u  with  wave  vector  ^  corresponds  to  the  Fourier  mode 
of  the  perturbation  with  wave  vector  ^  ^ .  Hence,  this  procedure  is  often  called  a 

side-band  analysis  and  ^  a  side-band  wave  vector. 

This  general  analysis  will  not  be  carried  out  here  (see  [24]  for  details),  but  some 
of  its  features  will  be  illustrated  by  analyzing  the  linear  stability  of  the  homogeneous 
solution.  In  that  case  (2.6)  simplifies  to 

(2-7)  dtU  =  (1  -h  ii')Au  -  (1  -h  ijji)aR{u  -h  u*) . 

When  it  is  written  as  a  real  linear  system  for  uji  and  uj,  the  real  and  imaginary 
parts  of  u,  this  equation  becomes 

^  '  '\uij  \uA-2ncrR  A  )\ui)' 

It  is  easily  checked  that  the  Fourier  mode  with  wave  vector  f  will  decay  exponen¬ 
tially  provided 

(2.9)  (1  +  t/2)|||4  ^  2aR{l  +  p  >  0 . 

This  will  certainly  be  satisfied  for  every  nonzero  ^ ,  no  matter  what  values  the  degree 
cr  and  forcing  R  may  have,  whenever  fj,  and  i/  lie  in  the  region  of  the  />ti/-plane  where 
>  0,  commonly  referred  to  as  the  modulationally  stable  region  (illustrated 
in  Figure  7,1  on  page  39). 

A  slightly  more  general  class  than  the  rotating  wave  solutions  are  the  harmonic 
(or  monochromatic)  solutions,  which  have  the  form 

(2.10)  u{x,  t)  —  a{t)  exp(zf -a::) , 
where  the  complex- valued  amplitude  a  =  a{t)  satisfies 

(2-11)  ^  =  {R-il  +  iv)\i\'^)a  -  (1  +  i^l)\a\^-a. 

Multiplying  this  equation  by  a*  and  separating  real  and  imaginary  parts  gives 
(2-12)  = 

It  is  easily  seen  that  when  72  —  p  <  0  these  solutions  decay  exponentially  to 
zero.  In  this  case,  these  solutions  are  part  of  the  stable  manifold  of  the  trivial 
solution.  When  72  —  =  0  these  solutions  decay  algebraically  to  zero,  reflecting 

the  marginal  stability  of  the  zero  solution  at  the  given  value  of  72  to  perturbations 
with  wave  vector  When  72  -  p  >  0  these  solutions  are  attracted  exponentially 
to  a  rotating  wave  solution  corresponding  to  the  wave  vector  In  this  case,  these 
solutions  lie  on  the  intersection  of  the  unstable  manifold  of  the  zero  solution  with 
the  stable  manifold  of  the  rotating  wave  solution  of  wave  vector  f , 
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The  rotating  waves  are  also  part  of  the  more  general  class  of  traveling  wave 
solutions,  each  of  which  has  the  form 

(2.14)  u[x,  t)  =  exp(-iat)  v{x  -  ct) , 

where  a  6  E  is  the  phase  frequency,  c  €  is  the  wave  velocity  and  v  =  v{z)  is  the 
complex-valued  profile  function  over  z  —  x  —  ct  that  satisfies 

(2.15)  {R  +  iQ)v  +  (1  +  -  c  •  VzV  -  (1  +  iij)\v\^^v  —  0  . 

Within  the  class  of  traveling  waves  are  the  so-called  standing  waves,  for  which  c  =  0, 
and  which  possess  a  nontrivial  spatial  structure  in  both  amplitude  and  phase.  A 
rotating  wave  (2.3)  will  solve  (2.15)  provided 

(2.16)  a;  =  Q;-|-^-c. 

More  general  traveling  wave  solutions  will  bifurcate  off  these  rotating  wave  solu¬ 
tions  as  R  increases  [77].  In  particular,  standing  waves  bifurcate  off  the  spatially 
homogeneous  solution  [67] .  Traveling  wave  solutions  have  also  been  shown  to  exist 
when  d  —  I  near  the  NLS  limit  [18].  However,  a  complete  understanding  of  all 
solutions  of  (2.15)  is  still  lacking. 

Stability  is  not  as  easy  to  assess  for  these  traveling  waves  as  it  was  for  the  ro¬ 
tating  waves.  For  example,  to  study  their  linear  stability,  introduce  a  perturbation 
of  the  form 

(2.17)  u{x,t)  =  exp(-iat)(t;(a:  -  ct)  -\-ev{x  ~  ct,t)) , 
into  the  CGL  equation  and  retain  only  terms  of  order  e  to  obtain 

dtv  =  (i?  +  ia)v  -h  (1  +  ii')AzV  —  c  • 

~  (f  +  ^A^)  —  cr(l  +  ^m)  |up^“^u(u*u  -t-  vv*)  . 

The  operator  on  the  right  side  is  independent  of  t,  so  the  linear  stability  analysis  is 
thereby  reduced  to  studying  the  semigroup  generated  by  that  operator,  which  is  not 
easy  because  the  operator  generally  has  coefficients  with  a  complicated  dependence 
on  2:. 

Harder  still  is  analytically  determining  the  bifurcation  sequence  of  solutions 
in  the  attractor.  Some  insight  into  these  questions  has  been  gained  through  nu¬ 
merical  studies  [57].  For  the  parameter  choices  studied  in  [57],  as  is  increased, 
the  standing  wave  solutions  undergo  supercritical  Hopf  bifurcations,  creating  two 
stable  periodic  orbits.  Further  increase  in  R  gradually  deforms  these  orbits  into 
a  pair  of  homoclinic  orbits  which  connect  the  homogeneous  solution  to  itself  (the 
homogeneous  solution  is  really  a  fixed  point  of  the  CGL  system).  When  this  ho¬ 
moclinic  orbit  is  broken,  a  single  stable  periodic  orbit  appears  which  looks  like  a 
combination  of  the  two  periodic  orbits  that  formed  the  homoclinic  orbit.  This  bi¬ 
furcation  sequence  is  known  as  a  “gluing  bifurcation”  (see  [33]  and  the  references 
therein);  it  is  not  a  period  doubling  bifurcation,  but  rather  a  length  doubling  that 
can  be  understood  in  a  three  dimensional  context.  At  this  homoclinic  connection,  a 
chaotic  set  is  not  created  because  of  the  stable  nature  of  the  periodic  orbits  involved: 
the  orbits  bring  their  attracting  neighborhoods  to  the  homoclinic  orbit,  which  is 
consequently  a  stable  object  in  the  phase  space,  and  therefore  cannot  produce  a 
hyperbolic  chaotic  set  of  trajectories  when  the  homoclinic  connection  is  broken  by 
a  change  in  R  [57].  As  R  is  increased  beyond  this  initial  bifurcation,  a  series  of 
unstable  homoclinic  orbits  occur  that  connect  the  homogeneous  solution  to  itself. 
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These  homoclinic  orbits  lead  to  chaotic  attractors  through  so-called  “homoclinic 
explosions”,  similar  to  those  found  in  the  Lorenz  equations. 

A  special  case  of  the  CGL  equation  for  which  the  attractor  can,  in  principle, 
be  completely  characterized  is  when  jj,  =  i/.  In  that  case  it  is  natural  to  introduce 
the  new  complex- valued  field  v  =  v{x,t)  by 

(2.19)  u{x^  t)  =  exp{—iRfjbt)  v{x,  t) . 

The  evolution  of  v  is  then  governed  by 

(2.20)  dtv  =  -{l+in)^, 

where 

(2.21)  =  + 

which  is  the  classical  variational  partial  derivative  with  respect  to  v*  of  the  so-called 
Ginzburg-Landau  functional  G  defined  by 


=  —At;  -h  —  Rv , 


G  =  [  \Vv\^  + -  R\v\^  dx . 


Computing  the  time  derivative  of  this  functional  under  the  flow  (2.20)  gives 


dG  f  t/vjr  _  W\jr  _  *  ,  ^  /  I 

(2.23)  — =  /  — 9tv  +  — fltv  dx  =  -2  —  dx, 

dt  Jjd  ov  ov*  Jjd  \  ov 

which  shows  that  G  is  a  monotonically  decreasing  function  of  time  except  at  points 
where  bG/6v*  vanishes.  Such  points  are  stationary  points  of  the  flow  (2.20)  and, 
hence,  are  standing  waves  of  the  CGL  flow.  They  include  as  a  special  case  the  circle 
of  stationary  harmonic  waves  given  by 

(2.24)  v{x)  =  bexp{i^'x) ,  =  i?  —  >  0 , 


(2.24) 


v{x)  =  bexp{i^’x) , 


for  some  6  6  C  and  f  €  27rZ^.  When  d  =  1,  the  stationary  points  generally  satisfy 
the  ordinary  differential  equation 

(2.25)  —dxxV  +  \v\'^^v  -  i?u  =  0 , 

so  their  existence  may  be  determined  through  a  phase  plane  analysis.  Furthermore, 
these  solutions  may  be  expressed  explicitly  in  terms  of  elliptic  functions  when  a  = 
1, 2  and  in  terms  of  hyperelliptic  functions  when  cr  >  2  is  an  integer. 

In  general,  (2.23)  shows  that  the  functional  G  formally  acts  as  a  global  Lya¬ 
punov  functional  for  the  flow,  a  fact  that  will  be  made  rigorous  once  sufficient 
regularity  is  established  to  justify  the  formal  manipulations.  This  will  be  done  in 
Section  5.  The  stability  of  any  stationary  point  v  can  be  examined  through  the 
spectrum  of  H(u),  the  Hessian  operator  of  the  functional  G  at  the  point  which 
is  given  by 


niv)  = 


'  6‘^G 

S^G 

6v*6v 

8v*6v* 

6^G 

S^G 

^  6v6v 

6v8v* 

-A  +  (<7 

+i)i«i 

-A  +  ((r  +  l)|vp-iE 
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The  Hessian  is  the  self-adjoint  linear  operator  associated  with  the  real  quadratic 
form  that  is  the  second  variation  of  G  at  i;.  Zero  is  always  in  the  spectrum  of 
H{v)  with  multiplicity  at  least  one  because  (iv,  is  always  the  null  vector 

associated  with  changing  by  a  constant  phase.  The  analysis  of  the  remaining 
spectrum  of  TUv)  is  generally  difficult  because  of  its  nonconstant  coefficients. 

In  the  case  when  is  a  stationary  harmonic  wave  (2.24)  the  Hessian  operator 

(2.26)  is  unitarily  similar  to  the  constant  coefficient  self-adjoint  operator 


(2.27)  H{v)  = 


-A  -  2ie  •  V  -F 


-A-h2i^  V4-a|6|2^ 


whose  spectrum  can  be  analyzed  through  Fourier  analysis.  For  example,  the  di¬ 
mension  of  the  unstable  manifold  of  the  circle  of  stationary  harmonic  waves  (2.24) 
is  given  by  the  number  of  nonzero  ^  E  27rZ‘^  such  that 


(2.28)  ll|"  +  2c7|6p-|||2-4(^.O"<0, 


assuming  there  are  no  such  ^  for  which  the  left  side  of  this  equation  vanishes.  In 
particular,  the  circle  of  stationary  harmonic  waves  (2.24)  will  be  stable  whenever 
<aR. 

When  d  =  1  and  a  ==  1  the  unstable  manifold  of  each  stationary  point  de¬ 
termined  by  (2.25)  can  also  be  analyzed  through  the  Hessian  (2.26)  by  using  the 
machinery  of  the  integrable  NLS  equation  [44].  However,  in  all  other  cases  our 
knowledge  of  the  phase  portrait  for  even  this  simplest  of  all  CGL  equations  (2.20) 
is  far  from  complete. 


3.  The  CGL  Equation  as  a  Model  for  Turbulent  Phenomena 

Much  work  on  the  theory  of  turbulent  phenomena  has  focused  on  simple  flu¬ 
ids,  where  the  phenomena  are  presumably  described  by  the  incompressible  Navier- 
Stokes  equations  [5].  One  of  the  major  obstacles  in  this  area  of  study  is  that  fact 
that,  as  of  this  writing,  the  existence  of  smooth,  or  classical,  or  strong  solutions  to 
the  Navier-Stokes  initial  value  problem  in  three  spatial  dimensions  has  never  been 
demonstrated  for  arbitrarily  large  (albeit  smooth)  initial  data,  or  arbitrarily  strong 
(albeit  smooth)  applied  forces  [14].  Leray  established  the  existence,  although  not 
the  uniqueness,  of  weak  solutions  to  the  Navier-Stokes  equations  over  half  a  cen¬ 
tury  ago  [55],  and  interpreted  the  implied  loss  of  regularity  in  the  general  case 
as  an  indication  of  the  onset  of  turbulent  behavior.  This  would  mean  that  vis¬ 
cous  dissipation  in  three  spatial  dimensions  is  unable  to  inhibit  singularities  driven 
by  instabilities  in  the  underlying  inviscid  system,  the  Euler  equations.  There  is, 
however,  an  obvious  logical  gap  associated  with  connecting  our  inability  to  prove 
regularity  and  uniqueness  with  an  actual  failure  of  those  properties.  Moreover,  the 
necessity  of  a  breakdown  of  regularity  in  the  Euler  equations  remains  an  open  area 
of  investigation  which  is  discussed  in  the  lecture  by  P.  Constantin  [12].  Uniqueness 
of  these  weak  solutions  is  not  established,  and  this  fact  carries  its  own  implica¬ 
tions  for  the  question  of  the  self  consistency  of  any  hydrodynamic  description  at 
all!  These  issues  remain  today  in  an  unsatisfactory  state  of  nonresolution. 

There  are  two  key  attributes  of  the  CGL  equation  that  allow  us  to  seriously 
consider  it  as  a  worthwhile  focus  for  general  studies  of  turbulent  behavior.  First, 
in  the  absence  of  driving  {R  =  0)  it  has  an  inviscid  limit  that  is  a  Hamiltonian 
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system,  namely,  when  fxv  is  the  nonlinear  Schrodinger  (NLS)  equation 

(3.1)  idiU  = ^ 


which  is  essentially  the  usual  Schrodinger  equation  of  quantum  mechanics  with  its 
potential  V  replaced  by  .  Second,  this  inviscid  limit  possesses  a  self-focusing 
mechanism  capable  of  provoking  spatially  localized  singularities  in  the  solutions, 
within  a  finite  time,  starting  from  arbitrarily  smooth  initial  data.  This  effect  can 
be  thought  of  as  a  rather  violent  cascade  process  whereby  energy  concentrated  in  low 
wavenumber  modes  is  rapidly  transported  to  high  wavenumber  modes.  These  two 
properties  are  in  direct  analogy  to  the  problem  of  incompressible  fluid  turbulence,  at 
least  so  far  as  the  conventional  wisdom  is  concerned.  The  Euler  equations  constitute 
the  underlying  Hamiltonian  dynamics  of  the  dissipative  Navier-Stokes  equations, 
and  instabilities  in  the  inviscid  equations  can  create  a  cascade  of  energy  between 
widely  separated  length  scales.  We  will  examine  these  two  attributes  in  more  detail 
below. 

The  NLS  equation  can  be  naturally  recast  as  a  Hamiltonian  system  in  the  form 

(3.2)  iudtu=^, 
where  the  Hamiltonian  H  is  given  by 

(3.3)  H=  f  dx . 

Jjd  <7  -f-  1 

Defining  the  Poisson  bracket  of  any  two  functionals  F  and  G  to  be 


(3.4) 


iv  Jjd  \  6u  6u* 


6F  6G\  ^ 
Su*  6u)  ^  ’ 


the  evolution  of  any  functional  F  under  the  NLS  flow  (3.2)  is  governed  by 


(3.5)  ^  = 

At  least  formally,  the  Hamiltonian  H  is  conserved  by  the  NLS  flow.  Besides  H,  the 
NLS  flow  also  generally  conserves  the  mass  M  and  the  momentum  P,  which  are 
given  by 


(3.6)  M  =  f  |wp  dx ,  P  =  ^  f  (u*Vu  —  uVu*)  dx . 

Jrd  2^  Jjd  ^ 

These  quantities  play  an  important  role  in  the  general  study  of  the  NLS  equation, 
but  such  a  digression  would  take  us  to  far  afield.  However,  it  should  be  noted  that 
the  one-dimensional  cubic  NLS  equation  (d  =  cr  =  1)  is  completely  integrable  and 
therefore  possesses  an  infinite  number  of  conserved  quantities  in  involution  with 
respect  to  the  Poisson  bracket  (3.4)  [82,  83]. 

Ginibre  and  Velo  [36]  showed  that  the  nonlinear  Schrodinger  equation  is  a 
locally  well-posed  initial- value  problem  in  the  Hilbert  space  (M^)  provided 

(3.7)  d  <  2  +  -  . 

(7 

More  precisely,  if  (3.7)  is  satisfied  then  for  every  p>  0  there  exists  a  time  T{p)  >  0 
such  that  for  every  initial  data  6  (R^^)  with  p  there  exists  a  unique 

u  e  C([0,T],H^(E‘^))  satisfying  a  mild  form  of  the  NLS  initial-value  problem. 
Moreover,  the  quantities  M,  P,  and  H  are  all  conserved  by  this  solution.  Extending 
this  result  to  is  not  direct,  and  requires  some  different  ideas.  Recently,  Bourgain 
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has  developed  new  techniques  that  allow  the  NLS  dynamics  over  periodic  domains 
to  be  extended  to  larger  spaces  [8,  9],  but  here  we  shall  confine  ourselves  to 
dynamics.  We  will  leave  imprecise  the  exact  nature  of  what  it  means  to  be  a  mild 
solution  of  the  NLS  initial-value  problem — it  is  not  important  for  the  following 
considerations.  However,  it  is  important  to  understand  that  whenever  (3.7)  is 
satisfied,  the  NLS  equation  will  make  sense  as  a  dynamical  system  in  at  least 
locally  in  time,  with  M,  P  and  H  conserved. 

Condition  (3.7)  arises  from  the  need  to  control  the  contribution  of  the  nonlinear 
term  of  the  NLS  equation  to  the  Hamiltonian  H  (or  equivalently,  to  control  the 
norm  of  u)  with  the  norm  of  u,  which  we  take  to  be 


1 


where  is  the  largest  negative  eigenvalue  of  the  Laplacian.  This  control  is 
achieved  through  a  Gagliardo-Nirenberg  type  of  Sobolev  inequality  [2,  32,  68], 
which  states  that  there  exists  a  positive  nondimensional  constant  C{a,  d)  <  oo  such 
that  the  norm  of  u  can  be  bounded  by  a  geometric  average  of  the  and 

norms  of  u  as 


1 


whenever  the  geometric  interpolation  parameter  0  satisfies 

(3.10) 

The  above  value  of  0  is  determined  by  balancing  units  of  length  in  (3.9),  thereby 
rendering  (7(<j,  d)  nondimensional,  which  leads  to  the  relation 

(3.11)  + 

The  inequality  in  (3.10)  is  equivalent  to  condition  (3.7). 

These  local  solutions  can  be  extended  to  be  global  solutions  if  one  can  find  an 
upper  bound  on  ||u||/i-i  that  is  uniform  in  time.  When  fiu  >  0  such  a  bound  follows 
directly  from  the  estimate 

(3.12)  v‘^\\u\\%<H  +  u^k^M. 

On  the  other  hand,  when  /«/  <  0  the  Gagliardo-Nirenberg  inequality  (3.9)  yields 
the  estimate 

H  +  v'^k^M  =  llullti  -I-  \\u\\%t+2 


>  II«IIh> 


C{a,  d) 


l(2cr+2)0  u,,{2a+2)(l-e) 


where  0  is  given  by  (3.10).  Because  ||ul|£,2  is  conserved,  the  above  estimate  gives  a 
uniform  upper  bound  for  ||u||ifi  whenever  the  degree  of  l|u||/fi  in  the  second  term  is 
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less  that  in  the  first  term,  which  is  the  case  if  and  only  \i  ad  <  2.  For  the  marginal 
case  crd  =  2,  the  above  estimate  gives  a  uniform  upper  bound  for  \\u\\h^  provided 
\\u\\l2  is  small  enough. 

The  above  argument  shows  that  necessary  conditions  for  the  possible  formation 
of  finite-time  singularities  are  ufj,  <  0  and  ad  >2.  This  is  simply  understood  by 
considering  the  NLS  equation  to  be  a  “mean  field”  quantum  mechanical  descrip¬ 
tion  of  locally  interacting  particles.  Without  loss  of  generality,  taking  >  0,  the 
potential  in  (3.1)  is  .  When  >  0  the  interaction  is  repulsive,  and 

consequently  there  is  no  tendency  for  the  particles  to  concentrate;  this  is  referred 
to  as  the  defocusing  case.  On  the  other  hand,  when  //  <  0  the  interaction  is  at¬ 
tractive,  providing  the  so-called  self-focusing  mechanism,  and  the  particles  tend  to 
concentrate;  this  is  referred  to  as  the  focusing  case.  When  ad  <2  this  attraction 
is  not  strong  enough  that  it  can  induce  a  collapse  of  the  matter.  However,  as  will 
be  argued  below,  when  ad  >2  this  attraction  is  strong  enough  that  collapse  may 
occur.  The  term  critical  indicates  that  the  condition  for  blow-up  is  satisfied  as 
an  equality,  ad  =  2.  The  terms  supercritical  and  suhcritical  indicate  ad  >  2  and 
ad  <2  respectively. 

That  the  self-focusing  mechanism  can  produce  finite  time  blow-up  for  solutions 
of  the  NLS  equation  when  ad  >2  was  first  established  over  [38,  81,  84]  by  the 
following  argument.  Given  any  u  e  consider  the  vector- valued  quantity  x 

and  scalar- valued  quantity  I  defined  by 


IwP  dx 


which,  by  analogy  with  classical  mechanics,  can  be  interpreted  as  the  center  of  mass 
and  the  moment  of  inertia  of  \u\^  respectively.  These  quantities  will  be  finite  for 
any  u  ^  (E‘^)  for  which 


[  \xf\u\‘^dx<oo 

jRd 


Moreover,  I  will  be  positive  for  any  nonzero  u  G  Now  consider  any  initial 

data  e  which  satisfies  (3.15).  When  (3.7)  is  satisfied  it  can  be  shown 

that,  so  long  as  the  corresponding  solution  u{t)  of  the  NLS  equation  exists, 
condition  (3.15)  remains  satisfied.  The  quantities  x  and  I  of  (3.14)  can  then  be 
differentiated  and  written  in  terms  of  the  conserved  quantities  M,  P  and  H  defined 
in  (3.6)  and  (3.3),  so  that 


dl  u  [  .  .  ,  . 


ad 
~2  ~ 


If  we  assume  that  i/p  <0  and  ad  >2,  which  are  necessary  conditions  for  a  possible 
singularity  to  develop,  then 

d?I  1P|2 
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This  inequality  can  be  integrated  to  obtain  the  bound 
(3.18)  m  <  7(0)  +  ^(0)  i  1  (if  - 

It  is  clear  that  if  is  chosen  so  that  H  —  \P\‘^ /M  is  negative  then  I{t)  would  have 
to  become  zero  at  a  finite  time  T*,  in  contradiction  with  (3.14),  which  shows  it  to 
be  positive.  The  solution  must  therefore  develop  a  singularity  no  later  than  the 
time  T*,  an  upper  bound  that  could  be  far  larger  than  the  actual  blow-up  time.  By 
exploiting  the  local  nature  of  the  self-focusing  mechanism,  this  argument  has  been 
extended  to  bounded  domains  for  various  kinds  of  boundary  conditions  (including 
periodic)  in  [48]  by  introducing  a  cut-off  version  of  the  \x  -  x\^  weight  in  (3.14). 

The  nature  of  the  NTS  singularity  has  been  studied  extensively  both  asymp¬ 
totically  and  numerically  [53,  54,  71],  and  is  understood  to  behave  self-similarly 
in  the  supercritical  case  and  nearly  so  in  the  critical  case.  In  this  sense,  our  knowl¬ 
edge  about  the  possible  breakdown  of  classical  solutions  is  much  better  for  the 
NLS  equation  than  for  the  three-dimensional  Euler  equations,  where  it  is  not 
even  known  whether  or  not  singularities  develop.  Numerical  simulations  show 
that  the  self-focusing  mechanism  persists  in  the  CGL  equation  inside  the  mod- 
ulationally  unstable  regime  1  -h  <  0  and  when  ad  >  2  is  capable  of  provoking 
many  spatially  localized  near  NLS-singularities  in  solutions,  starting  from  arbitrar¬ 
ily  smooth  initial  data.  Each  near  singularity  produces  a  rather  violent  cascade 
process  whereby  energy  concentrated  in  low  wavenumber  modes  is  rapidly  trans¬ 
ported  to  high  wavenumber  modes,  after  which  it  is  damped  by  the  CGL  dissipation 
and  then  is  dispersed  in  a  somewhat  less  violent  inverse  cascade  process  whereby 
some  undissipated  energy  returns  from  high  wavenumber  modes  to  low  wavenumber 
modes  [39,  75].  Admittedly  the  mechanisms  by  which  energy  transfer  is  generated 
for  the  CGL  equation  (self-focusing)  is  very  different  than  that  for  the  3-dimensional 
Euler  equations  (vortex  stretching),  and  we  make  no  claims  as  to  a  direct  physi¬ 
cal  analogy.  Self-focusing  blow-up  in  the  NLS  equation  plays  a  more  direct  role 
in  describing  strong  turbulence  in  plasma  (the  NLS  equation  is  a  limit  of  the  Za¬ 
kharov  equations  [40,  81]),  but  dissipative  mechanisms  in  plasma  do  not  directly 
correspond  to  the  dissipative  terms  in  the  CGL  equation.  Rather,  we  regard  the 
CGL  equation  as  a  convenient  nonlinear  dissipative  partial  differential  equation 
displaying  intrinsic  inviscid  instabilities  that  we  may  exploit  to  test  and  hone  our 
mathematical  tools. 

Finally,  there  are  several  distinct  advantages  to  considering  the  CGL  equation 
rather  than,  say,  the  Navier-Stokes  equations  or  the  Zakharov  equations.  First, 
there  is  the  obvious  point  that  there  are  simply  fewer  dependent  variables  in  the 
CGL  equation,  so  we  may  in  general  expect  the  analysis  to  be  simpler.  Indeed, 
the  basic  techniques  that  have  been  developed  for  the  study  of  semilinear  parabolic 
equations  [80],  and  the  NLS  equation  over  [10,  24,  79]  and,  just  recently,  over 
[8,  9],  can  be  applied  to  the  CGL  equation.  Second,  the  self-focusing  mechanism 
and  its  resulting  finite  time  blow-up  for  solutions  of  the  NLS  equation  is  quite  well 
understood  both  physically  and  mathematically  (also  see  [54,  61,  62,  71]  and 
references  therein).  Third,  from  a  numerical  viewpoint  it  is  much  faster  and  more 
convenient  to  run  simulations  in  low  spatial  dimensions,  and  the  self-focusing  blow¬ 
up  behavior  is  already  present  in  the  1-dimensional  NLS  equation  for  a  >  2. 
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4.  Global  Weak  Solutions 

For  many  nonlinear  evolution  equations  of  mathematical  physics,  classical  so¬ 
lutions  are  either  known  to  break  down  or  not  known  to  exist  after  a  finite  time. 
The  best  example  of  the  former  case  is  the  formation  of  shocks  in  solutions  of  the 
compressible  Euler  equations  of  gas  dynamics,  while  the  best  example  of  the  later 
case  is  the  open  problem  as  to  whether  temporally  global  classical  solutions  exist 
for  the  incompressible  Navier-Stokes  equations  of  hydrodynamics.  One  approach 
around  this  problem  is  to  enlarge  the  notion  of  a  solution  to  that  of  a  so-called  weak 
solution  by  only  requiring  that  the  equation  be  satisfied  in  a  “weak”  sense,  namely, 
after  integrating  it  against  a  member  of  a  prescribed  class  of  test  functions.  In  a 
seminal  paper  of  1934,  Leray  [55]  proved  the  existence  of  a  temporally  global  weak 
solution  to  the  incompressible  Navier-Stokes  equations  over  the  whole  space  for 
any  initial  data  with  finite  energy.  Subsequently,  the  strategy  used  by  Leray  has 
been  applied  to  prove  the  existence  of  global  weak  solutions  for  a  number  of  other 
equations,  most  notably  for  the  Boltzmann  equation  by  DiPerna  and  Lions  [19]. 
This  section  gives  the  analogue  of  this  result  for  the  CGL  equation  (1.1)  over  T^, 
the  d-dimensional  periodic  box. 

The  technical  setup  and  notation  for  this  section  is  as  follows.  Throughout  this 
section  it  is  more  convenient  to  use  7  =  <7  -f  1.  We  consider  the  evolution  of  a 
complex  valued  field  u  =  u{t,x)  governed  by  the  generalized  CGL  equation 

(4.1)  dfU  =  Ru  +  (1  -h  iv)I\u  —  (1  +  i^i) , 
with  initial  condition 

(4.2)  u(0,x)  =  u^’^(a:) . 

As  before,  we  assume  i?  >  0  and  7  >  1  (u  >  0),  while  v  and  can  take  any  real 
value.  This  arbitrariness  contrasts  sharply  with  the  theory  of  classical  solutions 
presented  in  later  sections  where  the  admissible  parameter  values  of  7,  u,  and  // 
and  the  dimension  d  are  interrelated. 

Without  loss  of  generality  the  units  of  length  may  be  chosen  so  that 

(4.3)  /  dx  =  l. 

We  employ  the  notation 

(4.4)  {f)=  [  fdx, 

Jjd 

which  by  (4.3)  denotes  the  mean  value  of  /  over  the  torus. 

In  order  to  state  the  main  result  of  this  section  we  must  first  introduce  the 
principal  spaces  involved.  For  1  <  p  <  00,  the  classical  Lebesgue  L^-space  over 
(T^^,^^;)  will  be  denoted  Lp(T^).  The  Sobolev  space  of  functions  in  L^(T^)  with 
partial  derivatives  in  will  be  denoted  Lr^(T^).  Given  any  Banach  space  X 

with  norm  ||  •  ||x  and  1  <  p  <  00,  the  space  of  (equivalence  classes  of)  measur¬ 
able  functions  v  =  v(t)  from  [0, 00)  into  X  such  that  ||u||x  6  I/P([0,  T])  for  every 
T  >  0  will  be  denoted  ^^^^([O,  oo),X).  And  finally  (^([0, 00),  w-L^(T^))  will  de¬ 
note  the  space  of  continuous  functions  from  [0,oo)  into  w-Z/^(T‘^),  which  denotes 
L^(T^)  equipped  with  its  weak  topology.  This  means  that  v  G  G([0, 00),  w-L^(T^)) 
if  for  every  -0  e  L^(T^)  the  function  t  1-^  is  in  (7([0, 00))  endowed  with 

the  usual  topology  of  uniform  convergence  over  compact  intervals.  We  remark 
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that  L^^([0,  oo),X)  and  ^([0,  oo),  w-L^(T^))  are  Prechet  spaces  rather  than  Ba¬ 
nach  spaces.  As  such,  their  topologies  are  completely  determined  by  the  class  of 
convergent  sequences. 

The  main  result  of  this  section,  established  in  [25],  is  the  following  global 
existence  theorem  for  initial  data;  it  is  the  CGL  analogue  of  Leray’s  result  for 
the  Navier-Stokes  equation. 


Theorem  4,1.  Given  e  there  exists  a  function 

(4.5)  n  e  C([0,^),w-i^(T-'))  nLL([0,oo),i7i(T‘'))  n4';([0,oo),L2nT")) , 

that  satisfies  the  initial  condition  (4.2)  and  the  weak  form  of  the  CGL  equation 
(4.1); 

0  —  {'lp*u{t2))  —  {'llj*u{ti))  ~  ^  f 

(4.6) 

+  f  ((l+iu)'Vil>* -Vujdt' +  f  +  , 

Jti  Jti 

for  every  [^1,^2]  C  [0, 00)  and  every  test  function 'll;  G  Moreover,  it  satisfies 

the  energy  relation 


f\\vu\\%dt'+  f 

Jo  Jo 


\u\t^,di' 


<i\\u'X^+r[  Ml.dt', 

Jo 


for  every  t  G  [0, 00) . 


Remark  4.1.  The  notion  of  weak  solution  given  above  is  stronger  in  two  ways 
than  the  notion  of  weak  solution  in  the  sense  of  distributions.  Recall  that  n  is  a 
weak  solution  of  the  CGL  equation  in  the  sense  of  distributions  when  it  satisfies 
the  equation  one  obtains  formally  by  integrating  (4.1)  over  [0, 00)  x  against  a 
test  function  w  =  w{t,x)  e  (^^([0, 00)  x  T^)  and  integrating  by  parts  so  as  to  put 
all  the  derivatives  onto  the  test  function.  It  can  be  seen  directly  that  if  u  satisfies 

(4.6)  then  it  satisfies  the  distribution  form  of  the  CGL  equation  for  test  functions  in 
the  factored  form  w  =  (l>{t)'ip*  {x)  where  (f)  G  (^^([0, 00)  and  'll;  G  (7°°(T^).  Indeed, 
set  ti  ~  0  in  (4.6),  multiply  by  ^*(^(^2)  and  integrate  t2  over  [0, 00),  and  integrate 
by  parts  to  put  the  spatial  derivatives  onto  As  linear  combinations  of  factored 
test  functions  are  dense  in  (^^([0, 00)  x  T^),  it  is  clear  that  (4.6)  alone  implies  that 
u  is  a  weak  solution  of  the  CGL  equation  in  the  sense  of  distributions.  In  addition 
to  (4.6),  the  weak  solutions  given  in  Theorem  4.1  are  required  to  satisfy  the  energy 
relation  (4.7). 

Remark  4.2,  The  strategy  of  the  proof  follows  that  introduced  by  Leray  in  the 
context  of  the  Navier-Stokes  equations,  as  well  as  to  that  of  many  other  existence 
proofs  for  weak  solutions  of  other  equations.  For  this  reason,  the  proof  will  be  given 
in  sufficient  detail  so  as  to  clearly  illustrate  this  strategy.  Roughly,  the  idea  is  to 
construct  a  sequence  of  solutions  to  equations  that  approximate  (4.6),  then  show  the 
sequence  is  relatively  compact  in  a  topology  that  is  strong  enough  to  allow  us  pass 
from  the  approximate  equation  to  the  limit  (4.6)  for  any  converging  subsequence. 
This  involves  striking  a  balance  between  the  facts  that  compactness  is  easier  to 
establish  for  weaker  topologies,  while  convergence  is  easier  to  prove  in  stronger 
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topologies.  Uniqueness  can  never  be  asserted  by  such  a  compactness  argument, 
but  often  requires  the  knowledge  of  additional  regularity  of  the  solution.  Since 
we  do  not  limit  ourselves  to  any  particular  spatial  dimension,  or  to  any  particular 
nonlinearity,  the  results  of  this  section  are  equally  applicable  to  the  supercritical, 
critical,  and  subcritical  CGL  equations. 


Proof,  The  proof  proceeds  in  four  distinct  steps. 


Step  1,  Construct  a  family  of  approximate  solutions  constructed  by  any 
method  that  yields  a  consistent  weak  formulation  and  an  energy  relation— for  exam¬ 
ple,  the  Fourier-Galerkin  method.  Let  denote  the  -orthogonal  projection  onto 
the  span  of  all  Fourier  modes  of  wave  vectors  ^  with  |^|  <  1/e.  Define 
and  let  =  'Ue(i)  he  the  unique  solution  of  the  ordinary  differential  initial-value 
problem 


(4,8) 


dtUe  =  RUe  +  (1  +  iiy)AUe  -  (1  +  ifi)Pe{\Ue\^^'^~'^'^Ue)  , 
U,{0)  =  <  €  PeL^{T^)  C  C^(T^) , 


The  regularized  initial  data  u^^{x)  converges  to  strongly  in  L^(T^)  as  e  tends  to 
zero  and  is  chosen  so  that  ||Ug"||^2  <  ||w^”||l2. 

Furthermore,  these  solutions  will  satisfy  the  regularized  version  of  the  weak 
form  (4,6)  given  by 


(4.9) 


0  =  {rne{t2))  -  irueih))  -  R  r  {ruf)  dt' 

Jtl 

dti  Ju 


for  every  [^1,^2]  C  [0, 00)  and  -0  G  C°°(T^).  Here  =  P^'iIj  will  converge  to  ip  in 
as  e  tends  to  zero.  Moreover,  these  solutions  will  satisfy  the  regularized  version 
the  energy  relation  (4,7)  as  the  equality 


(4.10) 
for  every  t  G  [0, 00 


Jo  Jo 


-  ilKIIL 


f 


\Ue\\i2dt'  . 


Step  1  follows  from  the  standard  Picard  local  existence  theory  for  ordinary 
differential  equations  applied  to  (4.8)  as  posed  in  the  finite  dimensional  space 
PeL^(T^),  and  the  fact  that  (4.10)  provides  a  global  bound  on  the  solutions, 
ensuring  that  they  are  global.  Hence,  we  omit  all  details  of  the  proof  and  proceed 
under  the  premise  of  Step  1. 


Step  2.  Show  that  the  sequence  is  a  relatively  compact  set  (has  compact 
closure)  in 

(4.11)  C{[0, 00),  w-L2(T‘'))  a  w-Lf„,([0, 00),  A  w-ifj,([0,  oo),  L^-r{T^)) . 

Remark  4,3.  Here  the  notation  “A”  indicates  the  intersection  equipped  with 
the  weak  topology  induced  by  the  inclusion  maps;  this  means  that  a  sequence  in  the 
intersection  is  convergent  if  and  only  if  it  converges  in  each  space  separately.  The 
sense  of  convergence  for  these  spaces  individually  is  as  follows.  We  have  Vn  ^  v  in 
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w-Aoc([07  oo),iJ^(T^))  when  for  every  T  >  0  and  every  'll;  e  oo), 

we  have 

(4.12)  f  •yvn)dt^  [  {'il;*v-\-S/'ip*  ^Vv)  dt. 

Jo  Jo 

Similarly,  we  have  Vn  v  in  w-Z/fj^([0,  oo),  L^'^(T^))  when  for  every  T  >  0  and 
every  C  ([0,  oo),  (T^))  we  have 

(4.13)  f  {'il;*Vn)  dt 

Jo 

where  (27)*  =  27/(27  -  1).  Finally,  we  have  Vn  v  in  (7([0, 00),  w-L^(T'^))  when 
for  every  -0  G  I/^(T^)  we  have 

(4.14) 

uniformly  on  compact  subsets  of  [0, 00) . 

Proof  of  Step  2.  Equation  (4.10),  along  with  ||Ug”||/,2  <  implies 

that 

(4.15)  IK(i)iii2<iikiii2+i^ 
from  which  the  Gronwall  Lemma  then  gives 

(4.16)  ll«e(i)lli.<IKIli.e=>«‘. 

Inserting  this  into  the  right  side  of  the  regularized  energy  relation  (4.10)  yields  the 
explicit  bound,  uniform  in  e: 

(4.17)  UMOWl.  +  f  llVwelli.  dt'  +  f  dt'  <  . 

Jo  Jo 

This  bound  establishes  that  the  sequence  {tie}  is  contained  in  compact  sets  of  both 
w-Lfo^.([0, 00),  Lf^(T^))  and  w-Lfj^([0, 00),  I/^'^(T‘^)),  because  norm  bounded  sets  are 
relatively  compact  in  weak-*  topologies,  which  are  the  same  as  the  weak  topologies 
on  these  reflexive  spaces.  For  the  same  reason,  the  uniform  bound  (4.16)  also  shows 
that  {lie(t)}  is  a  relatively  compact  set  in  w~L^(T^)  for  every  t  >  0. 

In  order  to  complete  Step  2  it  must  be  shown  that  {ue}  is  a  relatively  compact 
set  in  (7([0,  oc),  w-L^(T^)).  Compactness  requires  more  than  just  boundedness 
here  because  of  the  strong  topology  over  t.  We  appeal  to  the  Arzela-Ascoli  theorem 
[7,  72]  which  asserts  that  {wg}  is  a  relatively  compact  set  in  (7([0, 00),  w-L^(T‘^)) 
if  and  only  if 

(i)  {ue{t)}  is  a  relatively  compact  set  in  w-L‘^{T^)  for  every  t  >  0; 

(ii)  {tig}  is  equicontinuous  in  (7([0, 00),  w-L^(T^)). 

As  was  noted  at  the  end  of  the  last  paragraph,  condition  (i)  is  satisfied.  In  order 
to  establish  (ii),  we  must  show  for  every  ip  G  ^^(T^^) 

(ii')  {{'ip*u^)}  is  equicontinuous  in  C([0, 00)). 

This  is  done  by  first  using  the  regularized  weak  form  (4.9)  of  the  CGL  equation  to 
establish  (ii')  for  ip  in  and  then  using  a  density  argument  to  extend  (ii')  to  the 
general  case  of  ip  in  L^(T^)  (see  [25]).  □ 

Step  3.  Show  the  sequence  {u^}  is  relatively  compact  in  LfQ^([0, 00),  L^(T'^)) 
and  Lf/{“^([0,  oo),I/^^“^(T^))  considered  with  their  usual  strong  topologies. 
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Remark  4.4.  This  step  is  necessary  because  Step  2  allows  us  to  assert  the 
existence  of  a  weakly  convergent  subsequence  of  {We}?  say  with  a  limit  w,  from 
which  we  may  conclude  only  that 


yT  pT 

/  ||u||^2  dt  <  liminf  /  ||tte|li2  dt 

Jo  Jo 


However,  the  argument  that  recovers  (4.7)  from  (4.10)  will  need 


Nli,2dt  =  lim 


which  requires  strong  convergence.  Similarly,  the  argument  that  recovers  the 
nonlinear  term  in  (4.6)  from  that  in  (4.9)  will  require  strong  convergence. 

Proof  of  Step  3.  The  crucial  point  is  to  use  the  results  of  Step  2  along  with 
the  following  embedding  lemma,  which  is  essentially  due  to  Leray. 


Lemma  4.2.  The  injection 

(4.20)  C([0,cx>),w-L=*(T''))  A  w-LL([0,cx)),ifi(T‘'))  LL([0, 

is  continuous. 


This  lemma  is  proved  by  demonstrating  that  sequences  that  converge  to  zero 
in  both  (7([0,  oo),  w-L^(T^))  and  w-Li^^{[0^oo),  will  also  converge  in  the 

space  oo),  The  proof  is  fairly  direct,  so  we  refer  to  [25]  for  details. 

The  key  tool  used  is  the  Rellich  Theorem,  which  states  that  ^  compactly. 

Given  the  lemma,  the  portion  of  the  assertion  of  Step  3  is  argued  as  follows. 
Step  2  states  that  {ue}  is  a  relatively  compact  set  in  both  C([0,  oo), w-L^(T^)) 
and  w-LfQ^.([0,  oo),  if^(T^)),  and  because  the  continuous  image  of  a  compact  set 
is  compact,  it  follows  that  {ue}  is  a  relatively  compact  set  in  L^q^([0,  oo),  L^(T'^)). 
Therefore,  any  subsequence  of  {u^}  that  converges  in  both  (^([0,  oo),  w-L^(T^))  and 
H^(T‘^))  will  be  strongly  convergent  in  L^q^([0,  oo),  L^(T^)). 

The  portion  of  the  assertion  of  Step  3  is  a  direct  consequence  of  the 

portion.  We  consider  two  cases.  First,  if  7  <  3/2  (so  that  27  —  1  <  2)  then  the 
injection 

(4.21)  iL([0, «)),  l2(t‘^))  ^  Lf;-1([0, 00),  L^y-\r)) , 

is  continuous,  so  that  strong  convergence  in  Lf^^([0, 00),  L^(T^))  also  gives  strong 
convergence  in  L^J^“^([0, 00),  L^^~^(T^)),  as  was  asserted.  Second,  if  7  >  3/2  (so 
that  27  —  1  >  2)  then  the  strong  convergence  in  00),  L^(T^))  and  the  weak 

convergence  in  Lfj^([0, 00),  L^'>'(T^))  combine  in  a  standard  interpolation  argument 
to  yield  the  result  [25] .  □ 

Step  4.  Go  to  the  limit.  That  is,  the  weak  solution  u  in  Theorem  4^1  is  identi¬ 
fied  as  the  limit  of  a  convergent  subsequence  of  {ue}.  The  fact  that  this  subsequence 
converges  in  the  various  function  spaces  is  used  to  verify  the  weak  form  (4.6)  and 
energy  relation  (4.7). 

Proof  of  Step  4.  Step  2  ensures  that  there  is  a  subsequence  of  {tte},  which 
we  also  refer  to  as  {tte},  that  simultaneously  converges  to  a  limit  u  in  the  spaces 
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C{[0,<x>),w-L\T%  w-Lf„,([0,oo),//i(T‘'))  and  w-Lfj,([0,oo),L27(Trf)).  Thus,  as 
was  asserted  in  (4.5), 

(4.22)  u  e  CCIO,  00),  w-L2(T^))  n  lL([0,  oo),  H1(T‘*))  n  ifj,([0, 00),  L27(T<i)) . 

Step  3  then  implies  the  strong  convergence  of  Ue  to  u  in  both  oo),L^(T‘^)) 

andLfr'  ([0,  oo),  ^(T^)).  All  that  remains  is  to  show  that  the  limit  u  satisfies 
the  weak  form  of  the  CGL  equation  (4.6)  as  well  as  the  energy  relation  (4.7).  Toward 
this  end  we  check  convergence  of  each  term  in  the  respective  regularized  versions, 
(4.9)  and  (4.10). 

First  consider  the  regularized  weak  form  of  the  CGL  equation  (4.9)  for  an 
arbitrary  test  function  '0  6  C°°(T^)  and  interval  [^1,^2]  C  [0, 00): 


(4.23) 


0  =  {rueih))  -  (rneih))  ~  R  r  (rUe)  dt' 

Jtl 

+  f\il  +  iiy)Vip*-Vu,)dt'+  /'((l  +  j/i)^;iu,|2(T'-i)t,,)dt'. 
Jtl  Jtl 


The  convergence  of  to  u  in  (7([0, 00),  w-L^(T^))  means  (see  (4.14))  that  (V?*Wg(t)) 
converges  to  {'ip*u{t))  uniformly  over  [^1,^2],  whereby 

(4.24)  irueih))  (ruih)) ,  {rMt2))  -  {ruih)) , 


f  {ip*u^)  dt'  [  {tp*u)  dt' . 

Jtl  Jtl 


When  combined  with  (4.25),  the  convergence  of  in  w-  LL([0,oo),i/i(T‘^))  (see 
(4.12))  then  yields 

(4.26)  f  \{1  +  •  Vtze)  dt'  •  Vw)  dt' . 

Jtl  Jtl 

In  order  to  pass  to  the  limit  in  the  nonlinear  term  of  (4.23)  we  shall  exploit 
the  generally  useful  fact  that  if  is  a  weakly  convergent  sequence  in  a  Ba¬ 
nach  space  X  and  {/„}  is  a  strongly  convergent  sequence  in  the  dual  space  X* 
then  the  sequence  {fnivn)}  is  convergent  in  C.  Here,  the  strong  convergence 
of  Uj  in  Lfj“^([0,oo),  j^2j  implies  the  weak  convergence  of  in 

and,  hence,  in  the  Banach  space  L^([ti,t2]  x  T^)).  The  fact 
that  converges  to  'ip  uniformly  over  implies  that  it  converges  strongly  in 
L°°([^i5^2]  X  T"^))}  which  is  the  dual  of  L^{[ti,t2]  x  T'^)).  It  follows  that 

(4.27)  f  {{l-\-ifi)'ip*\ue\‘^^^~^^u^)dt'  ^  f  {{l-\-ifj,)'ip*\uf^^~'^'^u)dt' . 

Jtl  Jtl 

By  combining  (4.25)-(4.27),  one  can  pass  to  the  limit  in  each  term  of  (4.23)  and 
conclude  that  u  satisfies  the  weak  form  of  the  CGL  equation  (4.6). 

To  recover  the  energy  relation  (4.7)  consider  its  regularized  version  (4.10): 


Jo  Jo 


=  iHX^+R 


Jo 
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First  examining  the  right  side,  the  strong  convergence  of  the  initial  data  in 
implies 

(4-29)  ||<||i,  \\u^X^ , 

while  the  strong  convergence  of  Ue  in  oo),  implies 


Jo  Jo 


The  right  side  of  (4.28)  is  therefore  convergent  as  e  tends  to  zero.  Turning  to  the 
left  side  of  (4.28),  the  convergence  of  in  (7([0,  oo),  w-L^(T‘^)),  together  with  the 
general  fact  that  the  norm  of  the  weak  limit  of  a  sequence  is  a  lower  bound  for  the 
limit  inferior  of  the  norms,  yields 

(4.31)  \\u{t)fi,2  <  hM\\\2  • 


Similarly,  the  convergence  in  w-iL([0,oo),fri(T‘*))  and  w-Lf;<^([0,oo),L27(Trf)) 
implies 


/  dt'  <  Uminf  f  |K||^i  dt' , 

^0  Jo 

f  dt'  <  Uminf  f  \\u,\\%  dt' . 

Jo  ^-"0  Jo 


Taken  together,  (4.30)  and  (4.32)  give 


f  l|Vw||^2  dt'  <  liminf  [  ||Vwe||^2  dt' . 
Jo  ^0 


By  combining  (4.31),  (4.33)  and  (4.34),  one  obtains  a  lower  bound  on  the  on  the 
limit  inferior  of  (4.28).  The  energy  relation  (4.7)  is  thereby  satisfied  and  Theorem 
4.1  is  proved.  □ 

Remark  4.5.  The  energy  inequality  (4.7)  in  Theorem  4.1  can  be  strengthened 
slightly.  Indeed,  it  is  readily  seen  that  the  approximate  solutions  defined  by  (4.8) 
of  Step  1  will  satisfy  the  energy  relation 


mwi.  +  r  \\vu,\\%  dt  +  r  \M%  dt 

Jti  Jti 


=  ill«.(ii)ll 


l'^e|jjr,2  dt , 


for  every  [^1,^2]  C  [0, 00).  The  convergence  of  to  u  in  00),  ^^(T^))  implies 

||iAe||x,2  converges  to  \\u\\i2  in  ^^^^([0, 00)),  whereby,  upon  also  employing  the  Cantor 
diagonal  method,  one  can  pass  to  a  subsequence  of  such  that 

(4.36)  \\u^{t)\\^2  I|w(t)|fy2  for  almost  every  t  e  [0, 00) . 

Call  the  set  of  points  for  which  the  limit  holds  E.  When  ti  is  restricted  to  lie  in  E 
one  can  argue  starting  from  (4.35)  as  in  (4.28-4.34)  to  show  that  u  satisfies 


Hmi2+  r  \\vu\\i,  dt+  r\\u\\%dt 

Jti  jti 


<hMh)\f^2+R  f\\u\\l,dt 

Ju 
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for  every  [^1,^2]  C  [0,oo)  with  t\  €  E. 

Remark  4.6.  Replacing  with  and  with  the  compactly  sup¬ 

ported  test  functions  C^{R^)  everywhere  in  the  statement  of  Theorem  4.1  gives 
the  analogous  result.  The  strategy  for  its  proof  is  embodied  in  the  same  four 
steps  used  above  for  the  periodic  case.  Indeed,  Step  2  and  Step  4  have  only  su¬ 
perficial  differences  from  those  above.  The  Fourier- Galer kin  method  used  in  Step  1 
to  construct  approximate  classical  solutions  breaks  down  over  the  whole  space,  but 
can  be  replaced  by  so-called  smoothing  approximations.  Specifically,  a  contraction 
mapping  argument  such  as  in  [36],  can  be  used  to  construct  smooth  solutions  of 
the  initial- value  problem 

dtU^  =  RUe  +  (1  +  iu)Au^  -  (1  +  '*  (be  *  je  *  l^e)  , 

Ue(0,  X)  =  Uf{x)  =  je  *  u''^{x)  , 

where  the  regularization  is  achieved  through  convolution  with  a  smooth,  nonnega¬ 
tive  mollifier  j^.  Finally,  details  of  Step  3  must  be  modified  to  reflect  differences  in 
the  Rellich  Theorem  for  unbounded  domains.  These  technical  changes  are  straight¬ 
forward,  and  so  are  omitted. 

Remark  4.7.  One  can  show  the  existence  of  global  weak  solutions  for  the 
defocusing  {fiu  >  0)  NLS  equation  (1.2)  with  any  cr  >  0  for  finite-energy  initial 
data  using  similar  arguments,  but  using  the  conservation  laws  of  mass  and  energy 
in  place  of  the  dissipation  relation  (4.7)  to  obtain  the  weak  compactness  of  Step  2. 

5.  Local  Classical  Solutions 

The  solutions  found  in  the  last  section  are  too  weak  to  obtain  proofs  of  their 
uniqueness  or  regularity.  In  this  section  we  establish  the  local  existence,  uniqueness, 
and  regularity  results  for  classical  solutions  of  the  generalized  complex  Ginzburg- 
Landau  equation  (1.1).  Our  objective  here  is  to  provide  the  basic  results  necessary 
to  justify  the  formal  manipulations  necessary  to  establish  the  existence  of  global 
solutions  in  Section  7.  More  general  results  might  be  derived  by  adapting  the 
important  new  techniques  developed  by  Bourgain  [8,  9]  for  the  NLS  equation  over 
T^,  however  our  needs  here  are  less  demanding.  Indeed,  our  proofs  mostly  use 
standard  techniques  [32,  43,  70,  80]  and  some  detail  will  be  omitted. 

Local  existence  and  uniqueness  of  classical  solutions  of  the  generalized  complex 
Ginzburg-Landau  equation  will  be  established  by  using  the  contraction  mapping 
theorem.  In  the  first  part  of  this  section  we  will  describe — first  as  an  abstract 
result,  then  applied  to  the  CGL  equation— the  setting  necessary  to  obtain  a  so- 
called  mild  solution  from  continuous  initial  data,  and  elevate  it  to  a  classical  solution 
by  a  bootstrapping  procedure.  In  the  second  part  of  this  section  the  contraction 
mapping  argument  is  reexamined  in  order  to  extend  the  class  of  initial  data  that 
evolve  into  local  classical  solutions. 

Consider  the  evolution  oiu  =  u{t)  in  a  Banach  space  X  to  be  governed  by  the 
abstract  initial-value  problem 

(5.1)  dfU  =  Lu N {u) ,  u{0)  =  u^^eX. 

Here  the  linear  operator  L  is  assumed  to  be  the  infinitesimal  generator  of  a  strongly 
continuous  semigroup  S{t)  over  X  (so  that  the  linear  problem  with  N  ~  0  is  well- 
posed,  see  [70]).  The  perturbation  N  is  usually  a  nonlinear  map  over  X. 
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The  well-posedness  of  the  above  initial-value  problem  can  be  established  by  a 
contraction  mapping  argument  after  formally  recasting  (5.1)  in  its  so-called  mild 
formulation 


it)  =  S{t)u^^  +  t  S(t  -  t')N{u{t'))  dt' . 
Jo 


In  order  to  employ  the  contraction  mapping  theorem  we  shall  assume  that  the 
perturbation  N  is  locally  Lipschitz  as  a  map  from  X  into  itself.  More  specifically, 
this  means 

(i)  ||Ar(u)||x  <  Cbadlwllx)  for  every  w  G  X, 

(ii)  ||iV(Mi)  —  A^(«2)||x  ^  C'Lip(IIWillx)  ||W2||x)  l|wi  —  M2|Ix 
for  every  Ui,U2  €  X, 

where  (7bd(')  and  Cupi'r)  are  nondecreasing  functions  of  their  arguments.  We 
remark  that  N  and  L  can  always  be  chosen  to  satisfy  N{0)  =  0  in  which  case 
condition  (ii)  implies  (i).  Given  such  a  perturbation  one  can  prove  the  following 
basic  result  [70,  page  184]. 

Theorem  5.1  (Basic  Local  Existence) .  For  every  p  >  0  there  exists  a  time 
T{p)  >  0  such  that  for  every  initial  data  G  X  with  ||u^^||x  <  p  there  exists  a 
unique  u  G  (7([0,T],X)  satisfying  the  mild  formulation  (5.2).  In  addition^  the  map 
from  to  u  is  a  locally  Lipschitz  function  from  X  to  (7([0,T],  X). 

Such  a  u  is  called  a  mild  solution  for  the  initial- value  problem  (5.1);  it  provides 
the  starting  point  for  an  analysis  demonstrating  that  it  is  in  fact  a  classical  solution. 

Remark  5.1.  If  u  G  (7([0,  r],X)  is  a  mild  solution  of  (5.1)  then  a  direct  cal¬ 
culation  shows  that  it  is  also  a  weak  solution  of  (5.1)  in  the  sense  that 

(^|u(t2))x  “  ('^i^(^l))x 


/  (L*V^Kt))xdt4-  [  {'ip\N{u{t)))^dt, 

Jti  */ti 


for  every  0  <  <  ^2  ^  ^  and  ^  G  V{L*).  Here  (  •  |  •  is  the  usual  bilinear 

duality  between  X  and  its  dual  space  X*,  while  L*  is  the  usual  dual  adjoint  of  L 
with  domain  V{L*)  dense  in  X*. 

We  now  apply  this  general  theory  to  the  initial- value  problem  for  the  spatially 
periodic  generalized  complex  Ginzburg-Landau  equation.  In  that  case  we  choose 

(5.4)  Lu  =  (1 -f  iz^)Au  +  i?u ,  N{u)  = —{1 ipflul'^^u . 

The  associated  semigroup  S{t)  acting  on  u  e  X  can  be  written  as  a  convolution, 
S{t)u  ~  Gt  *  u,  with  its  Green  function  Gt  —  Gt{x)  for  t  >  0  given  by 


The  integral  equation  (5.2)  recast  in  terms  of  this  Green  function  takes  the  form 


it)  =  Gt  *  u'"  +  f  Gt^t'  *  N{u{t'))  dt' . 
Jo 
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In  order  to  apply  the  local  existence  theorem  one  only  need  identify  the  space  X 
and  verify  the  conditions  (i)  and  (ii). 

For  t  >  0  the  Green  function  (5.5)  satisfies  the  L^-estimate 

(5.7)  \\Gt\\^.<y]  (  \9t{x  +  n)\dx=  f  \gt{x)\dx  =  + 

„ez«  •'T" 

from  which  it  follows  that  S{t)  is  bounded  over  Lp(T'^)  for  every  1  <  p  <  oo  with 

(5.8)  ||5(f)«||i,  =  \\Gt*u\\j^,  <  ||u||i,  <  +  ||u||^, . 

Moreover,  it  can  be  shown  [32]  that  S{t)  is  a  strongly  continuous  semigroup  over 
(7(T^)  and  over  L^{T^)  for  every  1  <  p  <  oo. 

Remark  5.2.  The  bounded  operators  S{t)  defined  by  S{0)  =  I  and  S{t)u  = 
Gt^u  for  t>  0  form  a  semigroup  over  that  is  strongly  continuous  for  every 

^  >  0  but  only  weak-*  continuous  at  t  =  0.  (A  sequence  {vn}  converges  to  v  in  the 
weak-*  topology  on  if  it  does  so  integrated  against  arbitrary  functions  in 

We  first  appeal  to  the  local  existence  theorem  with  X  =  <7(T'^) .  The  perturba¬ 
tion  N  given  by  (5.4)  is  clearly  locally  Lipschitz  continuous  as  a  map  from  C'(T^) 
into  itself.  A  direct  application  of  the  local  existence  theorem  then  yields  a  unique 
mild  solution  u  —  u{t)  of  the  CGL  equation  over  a  time  interval  [0,  T]  that  depends 
only  on  the  norm  of  This  solution  is  the  limit  of  a  sequence  of 

successive  iterates  of  (5.6),  say  that  defined  by 

Gt-t'  ^N{u^^\t'))dt^ , 

that  converge  in  (7([0,r],  C(T'^))  for  some  T  chosen  sufficiently  small  that  the  se¬ 
quence  contracts. 

Additional  regularity  must  be  demonstrated  in  order  to  elevate  these  mild  so¬ 
lutions  to  classical  solutions,  specifically  they  ought  to  be  in  space  and  in 
time,  so  that  the  derivatives  appearing  in  the  equation  are  classical.  This  is  done 
using  the  following  standard  bootstrapping  argument.  We  use  the  regularity  of  Gt 
for  t  >  0  to  estimate  the  gradient  of  the  successive  iterates  (5.9)  in  the  norm  to 
show  that  the  sequence  converges  in  ^((0,  T],C^{T^)).  This  is  accomplished 

as  follows.  Take  the  gradient  of  the  integral  equation  (5.9), 

(5.10)  V«("+')(l)  =  VGt  *  +  f  VGt-t'  *  iV(u(”)(t'))  dt' , 

JQ 

and  estimate 

(5.11) 

||Vw("+'n0llL~  <  IIVGtlLi  lk‘"llL~  +  r  l|VGt_d|^i  ||Ar(«(")(«'))ILoo  dt'. 

Jo 


(5.9) 


{t)  —  Gt*  + 


/ 
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Then,  use  the  -estimate 


||VG*||^,  <  E  /  |V5*(x  +  n)|dx 

(5.12) 

=  /  \Vgt{x)\dx  =  Cail  +  uY^^^ 

Jr^  yt 

where  (7^  >  0  is  a  constant  depending  only  on  the  dimension  d,  to  find  that 


(5.13) 


l|V«("+')(i)|lioo  <  G,  (1  +  ^  ||«-||^, 


(1  +  t/y/^jJ 


This  shows  that  each  lies  in  C((0,  T],  (7(T^)).  By  invoking  the  Lipschitz  conti¬ 
nuity  of  N,  a  similar  estimate  shows  that  the  sequence  is  Cauchy  in  ^((O,  T],  (7(T‘^)), 
so  that  u  e  C{{0yT],  (7^(1^^)).  If  G  (7^(T^),  it  is  possible  to  rewrite  the  integral 
equation  for  Vu  without  putting  any  derivatives  on  (7f,  namely 

(5.14)  Vu{t)  +  f  Gt-t>  *  {DN{u{t^))Vu{t'))  dt' , 

Jo 

where  DN{u)  represents  the  derivative  of  N{u)  with  respect  to  u,  which,  when 
N{u)  is  given  by  (5.4),  acts  on  an  arbitrary  function  w  by 

(5.15)  DN{u)w  = —{1 a\u\‘^^~'^u^w*) . 


Consequently,  estimates  analogous  to  (5.13)  will  not  exhibit  a  singularity  at  ^  =  0 
and  hence  yield  a  w  G  C([0,  T],  (7^(T^)).  A  repetition  of  the  above  regularity  argu¬ 
ment  starting  from  (5.14)  rather  than  (5.6)  then  shows  that  u  is  in  (7((0,  T],  ^^(T*^)). 
Moreover,  because  the  CGL  equation  relates  the  first  time  derivative  to  the  second 
space  derivative,  the  solution  must  also  be  in  C^((0,  T],  (7(1^^))  and  is  therefore  a 
classical  solution  of  the  CGL  equation  so  long  as  it  is  a  mild  solution.  Summarizing, 
we  have  proved  the  following  result. 


Theorem  5.2  (Local  Classical  Solutions  for  (7®  Initial  Data).  For  every  p  > 
0  there  exists  a  time  T{p)  >  0  such  that  for  initial  data  G  (7(T^)  with  ||u^”||j[,oo  < 
p  there  exists  a  unique 

(5.16)  u  e  G([0, T], G(T‘'))  n  G((0, T], G2(T‘*))  n  Gi((0, T], G(T‘^)) , 

satisfying  the  CGL  initial-value  problem.  In  addition,  the  map  from  to  u  is 
a  locally  Lipschitz  function  from  C(T^)  to  (7([0,  T],  (7(T‘^)).  Moreover,  for  initial 
data  G  one  has  u  G  C{[0,T],C^{T^))  nC^{[Q,T],C{T^)). 

In  this  generality  one  can  not  expect  these  solutions  to  be  in  C((0, T],  (7^(T‘^)) 
because  unbounded  singularities  will  be  introduced  at  zeros  of  u  upon  further  dif¬ 
ferentiation  of  (5.15).  However,  additional  regularity  can  be  gain  in  some  cases. 
For  example,  when  a  is  a  positive  integer  the  nonlinearity  is  a  polynomial  in  u 
and  u*  and  one  can  freely  differentiate  (5.15)  without  introducing  any  singulari¬ 
ties.  Whence,  continuing  with  the  bootstrapping  argument,  it  can  be  shown  that 
for  every  t  in  (0,T]  the  solution  has  at  least  one  more  spatial  derivative  than  it 
had  initially.  But  then  repeating  this  argument  implies  that  the  solution  is  in 
(7((0,T],(7°°(T‘^)).  Moreover,  because  the  equation  relates  temporal  derivatives  to 
spatial  derivatives,  the  solution  must  possess  all  temporal  derivatives  too  and  is 
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therefore  a  smooth  (C^)  solution  of  the  CGL  equation  (1,1)  so  long  as  it  is  a  mild 
solution.  More  precisely,  we  have  proved  the  following  result. 

Theorem  5.3  (Local  Smooth  Solutions).  Let  a  >  0  be  an  integer.  Then  for 
every  p  >  0  there  exists  a  time  T{p)  >  0  such  that  for  every  initial  data  G  C{T^) 
with  ^  P  there  exists  a  unique 

(5.17)  u  e  C'([0,T],C'(T^))  n  C^((0,T]  x  T^) , 

satisfying  the  CGL  initial-value  problem.  Moreover,  given  smooth  initial  data  G 
then  u  G  C^{[0,T]  x  T^^). 

Remark  5.3,  The  above  result  justifies  the  formal  manipulations  we  will  carry 
out  in  Section  8,  thereby  enabling  us  to  focus  our  attention  on  the  establishment 
of  global  uniform  Sobolev  bounds.  Moreover,  in  Section  6  we  will  extend  Theorem 
5.3  to  show  that  when  cr  is  a  positive  integer  these  solutions  are  real  analytic. 

Even  when  a  is  not  a  positive  integer,  we  can  still  advance  the  bootstrapping 
argument  so  long  as  the  differentiation  of  (5.15)  does  not  introduce  unbounded 
singularities  at  the  zeros  of  u.  The  lowest  degree  of  homogeneity  for  the  factors 
u  and  u*  appearing  in  a  term  of  the  (n  +  1)®*  derivative  of  \u\‘^^u  will  be  2a  —  n, 
and  this  can  be  controlled  whenever  a  >  n/2.  In  that  case  the  bootstrapping 
argument  will  gain  an  additional  n  spatial  derivatives,  showing  that  the  solution  is 
in  This  observation  then  leads  to  the  following. 

Theorem  5.4  (Local  Solutions).  Let  a  >  n/2  for  some  positive  integer  n. 
Then  for  every  p  >  0  there  exists  a  time  T{p)  >  0  such  that  for  every  initial  data 
n*"  G  C{T^)  with  II  ^  P  there  exists  a  unique 

(5.18)  u  e  Cdo,  T],  C'(T‘^))  n  C((0,  T],  C"+2(T‘'))  n  ^^((O,  T],  C'"(T'')) , 

satisfying  the  CGL  initial-value  problem.  Moreover,  for  u'"  e  one  has 

u  e  (7([o,t],C'"+2(T‘'))  n  C'H[o,r],C"‘(T‘^)). 

Remark  5.4,  Of  course,  we  can  continue  to  trade  two  spatial  derivatives  for 
one  time  derivative.  If  n  =  2k  is  even  this  leads  to  n  G  (7^'^^([0,  T],  C(T^)),  while 
if  n  =  2A;  +  1  is  odd  one  finds  u  G  C'^“*'^([0,  T],  C^(T^)). 

Remark  5.5.  Theorem  5.4  will  be  applied  in  Section  7  with  n  =  1  in  order  to 
justify  some  of  the  formal  manipulations  there,  enabling  us  to  focus  attention  on 
the  question  of  global  existence. 

Remark  5.6.  Each  of  the  above  results  have  a  version  for  initial  data  in 
^oo('i’d)  gy  Remark  5.2  the  semigroup  S{t)  acting  on  //^^(T^^)  is  generally  not 
strongly  continuous  at  t  =  0  but  rather  is  weak-*  continuous.  The  corresponding 
concession  must  be  made  regarding  u  in  the  conclusions  of  the  L°^  versions  of  the 
above  theorems.  For  example,  in  the  version  of  Theorem  5.2  one  must  replace 
(5.15)  with 

(5.19)  U  e  C([o, T],  w*-i,~(T‘^))  n  ^((0, r], C'2(t‘^))  n  cH(o, T],c{r‘‘)) , 

where  w*-L°°(T‘^)  denotes  L°°(T‘^)  endowed  with  its  weak-*  topology.  Outside  of 
that  consideration,  the  proofs  of  the  L°^  versions  are  virtually  identical  to  those 
given  above  [25]. 
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The  basic  existence  argument  given  above  can  be  refined  to  greatly  enlarge  the 
class  of  initial  data  that  evolve  into  classical  solutions  for  positive  times.  We  recall 
the  mild  formulation  (5.2), 


(5.20) 


it)  =  S{t)u^^  +  f  S{t  -  t')N{u{t'))  dt' . 
Jo 


In  this  formulation  it  is  natural  to  distinguish  three  spaces.  One  aims  at  proving 
that  for  initial  data  in  some  space  X  the  solution  is  in  a  “better”  space  Y  on  some 
interval  (0,  T].  In  general,  the  nonlinearity  maps  Y  into  a  third  space  we  shall 
denote  by  Z. 

For  suitable  choices  of  X,  Y  and  Z  the  semigroup  will  be  strongly  continuous 
on  X  and  also  map  X  and  Z  back  into  the  “better”  space  Y  with  a  singularity  at 
t  =  0,  so  that  near  t  —  0  one  has  nonnegative  constants  a  and  /3  such  that 

(5.21)  ||5'(t)it;||Y  <  ||iy||x  for  every  u;  6  X , 

(5.22)  ||5'(t)t(;||Y  <  ct~^  W'^Wz  for  every  it;  G  Z. 

Again,  we  will  assume  that  N  is  locally  Lipschitz,  now  as  a  map  from  Y  to  Z. 
Without  loss  of  generality  one  can  assume  that  N{0)  =0,  and  for  our  convenience 
we  will  only  consider  the  case  where  the  Lipschitz  condition  can  be  written  as 

(5.23)  ||iV(ui)  -  JV(U2)||2  <  C  (ihlliy"  +  ||«2||y")  ||«1  -«2|Iy 

for  some  tr  >  0.  The  a  above  will  be  identified  with  the  cr  of  the  CGL  nonlinearity 
(5.4).  This  covers  many  applications,  including  the  CGL  equation.  Modifications 
that  embrace  more  general  nonlinearities  are  not  hard  [43,  70]. 

Theorem  5.5  (Extended  Local  Existence).  Given  the  above  estimates  (5.21), 

(5.22)  and  (5.23)  where  the  exponents  a,  (3  and  a  satisfy 

(5.24)  0  <  /?  <  1 , 

(5.25)  0  ^  (2(7  4"  1)q:  <C  1 , 

(5.26)  p  +  2aa  <  1 , 

for  every  p  >  0  there  exists  a  time  T{p)  >  0  such  that  for  every  initial  data  e  X 
with  ||w^”||x  <  p  there  exists  a  unique 

(5.27)  ^  G  C([0,r],X)  n  C((0,T],  Y) 

satisfying  the  mild  formulation  (5.20).  In  addition,  the  map  from  to  u  is  a 
locally  Lipschitz  function  from  X  to  (^([0,  T],  X). 

Proof.  In  order  to  indicate  how  to  set  up  the  contraction  mapping  argument, 
we  first  estimate  the  fixed  point  equation  (5.20)  in  the  Y  norm. 


(5.28) 


[\t-t') 

Jo 


||w(t)||y  <  c  ||w™||x  +  cr  j\t  -  dt' 

^  ''Jo 
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This  suggests  that  we  might  be  able  to  prove  contractivity  in  the  space  E([0,T]), 
defined  as  the  completion  of  C'([0,T],Y)  in  the  norm 

(5.30)  lk||j;=  sup  t“||w;(«)|iY. 

«e[o.T] 


Indeed,  (5.29)  shows  that  provided 


(2a+l)a 


0  as  t 


an  assumption  that  is  directly  seen  to  be  equivalent  to  (5.24-5.26),  there  exists  a 
T  >  0  such  that  the  right  side  of  the  fixed  point  equation  (5.20)  maps  sufficiently 
large  balls  with  respect  to  the  E  norm  into  themselves.  Invoking  the  Lipschitz 
condition  (5.23),  one  can  show  in  a  similar  way  that  this  mapping  contracts  under 
the  same  assumption  (5.31)  for  some  T  >  0.  Then  by  the  contraction  mapping 
theorem,  there  exists  a  unique  solution  u  C  E([0,T])  C  (7((0,T],Y)  to  the  fixed 
point  problem  (5.20). 

Finally  one  can  show  that  tt  €  (7([0,  T],X).  To  this  end  we  only  need  to 
check  the  continuity  of  w  at  t  =  0.  First  subtracting  from  both  sides  of  the 
mild  formulation  (5.20),  a  direct  estimate  in  X,  assuming  a  continuous  embedding 


|w(i)  -'“‘"llx  ^ 


(2or+l)Q;  A/O: 


+  c\\uC+^ 


(2a'4'l)Q: 


The  first  term  on  the  right  converges  to  zero  due  to  the  strong  X  continuity  of 
the  linear  semigroup.  Since  \\u\\e  is  bounded,  the  second  term  converges  to  zero 
provided  that  {2a  +  l)a:  <  1,  a  condition  already  contained  in  (5.31).  □ 

Remark  5.7.  Strictly  speaking,  here  we  have  proved  existence  in  a  smaller 
space,  namely  E([0,T]),  than  that  asserted  in  (5.27). 

Let  us  now  apply  Theorem  5.5  to  the  CGL  equation  where,  as  before,  N  and  L 
are  given  by  (5.4)  and  the  associated  linear  semigroup  by  (5.5).  We  first  consider 
the  case  where  the  initial  data  is  in  i.e.  X  =  Lp{T^).  We  also  take  Z  =  Lp{T^) 
and  let  Y  =  L^(T^)  where  r  is  still  to  be  determined.  It  takes  some  relatively 
straightforward  applications  of  Holder  and  Young  inequalities  to  verify  that  the 
Lipschitz  condition  (5.23)  is  satisfied  whenever 

(5.33)  r>(2cr  +  l)p. 

In  order  to  compute  the  exponent  a  of  (5.21),  the  exponent  of  the  “penalty”  in¬ 
curred  by  using  S{t)  as  a  smoothing  operator  from  to  L’’,  we  estimate 

(5.34)  ||G*  *  <  \\Gt\\^,  IkIL,  <  \\GtC^  ||«;||^, 

where 


(5.35) 


.  1  1  1 

1  H —  =  — I —  . 
r  p  q 


By  using  the  estimate  (5.7)  and  the  estimate 


(5.36) 
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for  positive  absolute  constants  a  and  b,  one  finds  that  for  small  t 

(5.37)  IIG.II^,  =  0(t“) ,  lIGtll^^  =  . 

This  used  with  (5.34)  and  (5.35)  gives 

(5.38)  l|5(*)«^IL.  <  ||u;||^,  , 

i.e.  a  ~  f3  =  f(^“f)-  Then  Theorem  5.5,  first  applied  in  the  marginal  case 
r  =  (2(7  +  l)p,  asserts  the  existence  of  a  time  T  >  0  and  a  unique  local  solution 

(5.39)  u  e  C'([0,r],LP(T^))n(7((0,r],L^(T^)) 

subject  to  the  constraint  (2a  +  l)a  <  1,  i.e.  ad  <  p. 

Since  u{t)  e  L^(T^)  for  all  t  >  0,  one  can  re-apply  the  argument  to  show  that 

(5.40)  w  €  C((0,r],L^-(T^))  where  =  (2a  +  l)^p 

for  every  m  G  N.  Finally,  for  some  >  (a  -\-  ^)d  we  can  take  X  =  Z  =  L^"^(T^) 
and  Y  =  (7(T^)  endowed  with  the  supremum  norm  to  obtain  u  G  C((0, T],  ^(T^^)). 
An  application  of  Theorem  5.2  then  yields  the  following. 

Theorem  5.6  (Local  Classical  Solutions  for  Initial  Data).  If  p  satisfies 

(5.41)  1  <  p  <  00 ,  and  ad  <  p^ 

then  for  every  p  >  0  there  exists  a  time  T{p)  >  0  such  that  for  every  initial  data 
^  ^itfi  <  p  there  exists  a  unique 

(5.42)  w  G  C([0,r],LP(T^))  nC((0,r],C2(T^))  nC^((0,T],C(T‘^)) , 

satisfying  the  CGL  initial-value  problem.  In  addition,  the  map  from  to  u  is  a 
locally  Lipschitz  function  from  L^(T^)  to  <^([0,  T],  L^(T‘^)). 

Remark  5,8.  For  p  >2  the  uniqueness  of  such  classical  solutions  can  be  ex¬ 
tended  to  hold  within  the  class  of  weak  solutions  of  Theorem  4.1,  In  other  words, 
so  long  as  a  weak  solution  of  the  CGL  equation  is  classical  then  it  is  unique.  What 
is  striking  here  though  is  that  in  subcritical  cases  (ad  <  2)  Theorem  5.6  asserts 
the  existence  of  solutions  with  initial  data  for  some  p  <  2 — solutions  which 
therefore  lie  outside  the  class  of  weak  solutions  of  Theorem  4.1.  In  fact,  as  we  will 
show  below,  for  the  subcritical  case  a  =  d  =  1  one  can  use  the  same  technique  to 
establish  the  existence  of  solutions  for  initial  data  in  classes  of  distributions  II^(T) 
for  <q  <0, 

Remark  5.9.  This  result  does  more  than  extend  to  the  class  of  initial  data 
that  evolve  into  classical  solutions  for  positive  times.  More  importantly,  it  estimates 
the  time  interval  over  which  any  classical  solution  will  exist  in  terms  of  its  norm. 
This  means  that  in  order  to  prove  that  any  classical  solution  is  global  in  time  it 
suffices  to  control  its  norm,  where  p  satisfies  (5.41).  In  Section  7  we  present  such 
temporally  global  a  priori  bounds  for  an  interval  of  i/  values  depending  only  on 
a  and  d,  but  not  p.  In  that  case  one  has  global  classical  solutions  (Theorem  7.1). 

Remark  5.10.  The  proof  given  here  differs  from  that  in  [80]  in  that  the  space 
E([0,T])  used  here  for  the  contraction  mapping  argument  is  larger  than  the  space 
used  there,  but  can  be  abstracted  from  [80]  with  minor  modifications.  Essentially 
the  same  methods  lead  to  similar  results  for  the  CGL  equation  over  More 
sophisticated  techniques  lead  to  similar  results  even  in  the  dissipationless  case  of 
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the  NLS  equation,  both  over  [10,  24,  79]  and  over  T  where  Bourgain  [8,  9]  has 
developed  an  L^(T)  theory  for  a  <2. 

Remark  5.11.  An  important  special  case  is  when  p  =  2(7  +  2,  in  which  case 
the  above  theorem  yields  local  classical  solutions  whenever 

2 

(5.43)  d<2  +  -. 

This  condition  coincides  with  condition  (3.7)  for  the  existence  and  uniqueness  of 
local  solutions  for  the  NLS  equation  in  77^(1^^).  In  Section  7  we  present  temporally 
global  a  priori  bounds  for  a  region  of  the  pz/'-plane  depending  on  cr,  but 

not  d.  In  that  case,  whenever  (5.43)  is  satisfied  one  has  global  classical  solutions 
(Theorem  7.2). 

In  the  following,  we  will  use  Theorem  5.5  in  its  full  generality  to  prove  existence 
of  a  local  solution  starting  from  initial  data.  The  Sobolev  spaces  can  be 

defined  for  any  g'  C  E.  To  do  so  we  will  use  w{^)  to  denote  the  Fourier  coefficients 
of  a  function  w  €  L^(T^),  so  that 


(5.44)  w{x)  =  ^  w{^)  ,  w{^)  =  e 

The  norm  of  a  function  w  can  then  be  defined  [2]  as 


{^)=  f  e-^^-^w{x)dx 

JTd 


-  E  + 


^€27rZ^ 


The  existence  result  we  are  going  to  prove  is  relatively  technical  and  will  not 
be  used  in  any  of  the  subsequent  sections,  so  that  it  may  be  skipped  in  a  first 
reading.  The  motivation  for  this  further  inquiry  comes  from  the  observation  that 
in.  the  CGL  equation  scales  dimensionally  like  A  so  that  by  formally  trading 
powers  for  derivatives  in  condition  (5.56),  one  transforms  Theorem  5.6  into  the 
following  conjecture.  Given  E  where  q  >  f  —  there  exists  a  unique 

local  classical  solution.  We  will  show  under  certain  restrictions  that  this  is  indeed 
the  case  [56] . 

We  first  remark  that  it  is  very  problematic  to  conduct  the  argument  entirely 
in  spaces.  The  difficulty  lies  in  obtaining  a  Lipschitz  estimate  (5.23),  which  is 
easily  accomplished  only  when  Y  =  Z  =  iJ^(T^)  is  an  algebra,  i.e.  when  q  >  d/2. 
(See  [2]  for  a  review  of  Sobolev  algebras.)  In  this  case  the  basic  local  existence 
result.  Theorem  5.1,  immediately  assures  the  existence  of  a  local  mild  solution. 
This  approach  however  does  clearly  not  exhaust  the  conjectured  range  of  admissible 
values  for  q. 

Secondly,  we  observe  that  the  only  interesting  case  is  ^  <  0.  Namely,  when 
Qf  >  I  —  L  and  q  >  0,  one  can  always  find  a,  p  >  crd  so  that  the  Sobolev  embedding 
jjq  ^  IP  holds,  i.e.  that  \  Therefore  in  this  case  local  classical  solutions 
are  already  guaranteed  by  Theorem  5.5. 

Assume  therefore  that  E  =  X  for  g  <  0.  Take  the  two  other  spaces 

as  in  the  argument,  i.e.  Y  =  L’'(T^)  and  Z  =  I/P(T^).  While  /^  =  f  (^  “  r) 
the  restriction  r  >  (2cr  +  l)p  are  as  before,  it  still  remains  to  find  the  exponent  a 
of  the  “penalty”  for  using  S{t)  as  a  smoothing  operator  from  to  .  This  can 
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be  done  by  going  via  the  intermediate  space  i.e.  by  using  (5.38)  to  infer  that 
for  r  >  2, 


(5.46) 


\\S{2t)w\\^^<ct  2(2  r-)  ||5(<)w||^3 


The  convolution  S{t)w  =  Gt*w  is  best  estimated  by  deriving  bounds  on  its  Fourier 
transform.  Specifically,  we  have 

\\Gt*w\\l= 

(5.47) 

<  sup  {(i+i^n-’is(op}  E 

^e27rZ‘^  ^e27rZ<i 

where  w{^)  denotes  the  Fourier  coefficients  of  w  and  the  Fourier  coefficients  of  Gt 
are  given  by 


(5.48) 


J(^)  = 


The  supremum  in  (5.47)  is  easily  estimated  by  simple  optimization  with  respect  to 
resulting  in 


2{R+l)t 


kll 


(5.49)  ||G,*«,|li.<(^) 

For  sufficiently  small  t  one  has  altogether 

(5.50)  ||S'(t)li;||^^  <  Ct  2^2  2  ||u/||^q  . 

Again,  we  take  the  marginal  case  of  the  Lipschitz  condition,  r  =  (2cr  +  l)p.  Then 
condition  /?  +  2a a  <  1  of  Theorem  5.5  is  satisfied  if  and  only  if 


w\ 


(5.51) 


d  1 
^^2~a 


We  still  need  to  check  the  other  two  conditions  of  that  theorem.  Condition  (5.25) 
imposes  an  upper  bound  on  r,  namely 

(5  52)  ,  ;  2{2a  +  l)d 

^  ^  {2a  +  l){d-2q)-4' 

Similarly,  (5.24)  imposes  a  lower  bound  on  r, 

(5.53)  r  >  ad. 

Note  first  that  given  (5.51),  the  upper  and  lower  bounds  on  r  can  be  satisfied 
simultaneously.  For  technical  reasons,  we  also  have  two  additional  lower  bounds  on 
r.  In  the  derivation  of  (5.50)  it  was  necessary  to  assume  that  r  >  2,  and  we  must 
also  require  p>l,  i.e.  r>2(j  +  l.  These  combined  with  (5.52)  translate  into 


(5.54) 

2 

^  ^  2<r  +  1 

and 

(5.55) 

d  d  2 
^^2  2ct  + 1 

respectively.  Then  finally  Theorem  5.5  yields  a  unique  mild  solution  u  G  C((0,  T],  Y) 
for  some  T  >  0.  Moreover,  since  u{t)  G  L^(T^)  for  all  t  G  (0,  T],  further  regularity 
can  be  achieved  by  a  direct  application  of  Theorem  5.6.  One  thus  has  the  following. 
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Theorem  5.7  (Local  Classical  Solutions  for  Initial  Data).  Provided  q  sat¬ 
isfies  (5.54),  (5.55)  and 

d  1 

(5.56)  g  >  2  -  - 

then  for  every  p  >  0  there  exists  a  time  T{p)  >  0  such  that  for  every  initial  data 
with  <  p  there  exists  a  unique 

(5.57)  u  e  C([0,T],i7«(T‘'))  n(7((0,T],C2(T‘^))  nCi((0,r],C(T‘')), 

satisfying  the  CGL  initial-value  problem.  In  addition,  the  map  from  to  u  is  a 
locally  Lipschitz  function  from  to  C{[0,T], 

Remark  5.12.  We  believe  conditions  (5.54)  and  (5.55)  are  technical  and  can 
be  removed  by  more  sophisticated  techniques.  Moreover,  by  following  [10],  the  use 
of  Besov  spaces  should  allow  one  to  obtain  the  marginal  case  when  (5.56)  is  satisfied 
as  an  equality.  Note  also  that  provided  a  >  5  and  ad  >  1 — thus  in  particular  when 
a  is  an  integer — condition  (5.56)  dominates  and  we  have  proved  local  existence  for 
the  conjectured  class  of  initial  data. 

Remark  5.13.  The  results  in  this  section  extend  to  with  only  minor  mod¬ 
ifications  in  their  proofs. 


6.  Analyticity  of  Solutions 

Let  us  now  discuss  Gevrey  class  regularity  which  is  a  stronger  notion  than  the 
Qoo  regularity  we  have  encountered  before.  It  not  only  asserts  that  all  derivatives 
of  the  solution  u  are  bounded,  but  also  that  these  bounds  depend  on  the  order  of 
the  derivatives  in  some  prescribed  way.  Gevrey  [34]  used  this  notion  as  a  setting 
in  which  to  extend  Cauchy-Kowalevski  existence  arguments  to  classes  of  functions 
that  are  not  necessarily  analytic  (for  a  review  of  the  analytic  case  see  e.g.  [45]).  In 
fact,  they  are  special  cases  of  the  quasianalytic  classes  which  had  been  introduced 
earlier  by  Hadamard  [41].  La  Vallee  Poussin  [52]  showed  that,  among  the  quasiana¬ 
lytic  functions,  the  Gevrey  classes  are  characterized  by  an  exponential  decay  of  their 
Fourier  coefficients  (see  also  [49]).  In  turn,  this  characterization  has  recently  proved 
useful  for  showing  that  the  solutions  of  various  nonlinear  partial  differential  equa¬ 
tions  are  analytic.  Foias  and  Temam  [30]  developed  this  technique  in  the  context 
of  the  two-dimensional  incompressible  Navier-Stokes  equations,  thereby  simplifying 
earlier  work  of  Kahane  [47]  that  had  been  more  in  the  style  of  Gevrey.  Doelman 
and  Titi  [22]  subsequently  applied  the  technique  to  the  cubic  CGL  equation,  and 
Ferrari  and  Titi  [27]  extended  the  results  to  a  large  class  of  analytic  nonlinear 
parabolic  equations.  In  particular,  the  authors  of  the  last  reference  have  exploited 
the  fact  that  certain  subclasses  are  normed  algebras,  thus  making  the  estimation 
of  polynomial  nonlinearities  a  triviality.  This  is  the  strategy  we  will  adopt  because 
of  its  brevity  and  elegance. 

A  function  w  E  (7‘^(T^)  is  said  to  be  of  Gevrey  class  s  for  some  s  >  0  if  there 
exist  constants  p  >  0  and  M  <  oo  such  that  for  every  x  e  and  every  a  e 
one  has 
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Here  we  employ  the  usual  multi-index  notation  in  which 

2cr+l  2<t+1  d 

(6.2)  |a|  =  =  H  -  IT  ' 

j  =  l  j  =  l  j  =  l 

The  set  of  all  functions  of  Gevrey  class  5  is  a  vector  space,  denoted  G^(T^).  It  is 
closed  under  multiplication  and  differentiation.  Moreover,  the  composition  of  two 
functions  of  Gevrey  class  s  is  again  of  class  s. 

It  is  classical  that  is  the  space  of  real  analytic  functions  (7^(T^);  a 

proof  can  be  found,  for  example,  in  [45,  page  65].  For  0  <  s  <  1  the  class  (7^(T^) 
is  subclass  of  the  analytic  functions,  while  for  1  <  s  <  oo  it  contains  the  analytic 
functions.  In  fact,  one  has  a  hierarchy  of  spaces  such  that  O<S1<S2<00  implies 
the  proper  containments 

(6.3)  (T^)  C  (T^)  C  (T^) . 

Moreover,  the  union  of  the  does  not  exhaust  because  there  are 

quasianalytic  functions  that  are  not  members  of  a  Gevrey  class  [52]. 

In  what  follows  it  will  be  more  convenient  to  characterize  Gevrey  classes  in 
terms  of  the  fractional  Sobolev  spaces  (as  defined  on  page  28  for  every 

r  €  M)  with  r  >  0,  rather  than  uniform  bounds  such  as  (6.1). 

Lemma  6.1.  Given  s  >  0  and  r  >  0,  Then  w  €  G®(T^)  if  and  only  if  there  are 
constants  p,  M  €  (0,  oo)  that  may  depend  on  r,  s  and  w  such  that  for  every  n  G  N 
one  has 

(6.4)  iiv"hIh^=  (  E  (i+ifiTi^i^"R£)n  <M 

\Ce27rZ‘^  / 

The  proof  is  achieved  by  a  direct  application  of  the  Sobolev  embedding  theorem 
[2]  and  shall  be  left  to  the  reader. 

This  lemma  enables  us  to  characterize  functions  in  in  terms  of  the  decay 
of  their  Fourier  coefficients,  a  result  of  the  type  first  due  to  La  Vallee  Poussin 
[52,  59],  although  this  attribution  has  been  obscured  in  the  recent  literature.  The 
construction  here  uses  the  operator  A  =  \/~A  that,  like  —A  itself,  is  nonnegative 
and  self-adjoint  so  that  arbitrary  powers  can  be  defined  by  spectral  theory.  For 
each  s  6  (0,  oo)  we  define  a  family,  parameterized  by  r,  of  normed  spaces 

(6.5)  :  ii'’'(T‘^))  =  <  ooj  . 

The  functions  in  any  such  space  have  Fourier  coefficients  that  decay  faster  than 
exp(— r|^|^/®).  The  next  theorem  recovers  the  Gevrey  class  G®(T^)  as  the  union  of 
all  such  classes.  We  believe  the  proof  is  new. 

Theorem  6.2.  For  any  s  >  0  and  r>0, 

(6.6)  G"(T‘')  =  U  ;  i/’'(T'*)) . 

T>0 

Remark  6.1.  The  use  of  the  more  general  rather  than  simply  as  the 
base  space  does  not  complicate  the  structure  of  the  proof,  but  it  is  advantageous 
for  the  arguments  that  will  follow. 
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Proof.  Let  w  e  for  some  r  >  0  and  let  p  =  r/s.  Then 


Ed +  1^1^ 


,n  \c\n/i 


tAV«  ||2 

e  -u/L, 


By  setting  M  =  \\e'^^^^^w\\H^  we  obtain  (6.4),  whereby  w  G 

On  the  other  hand,  let  w  G  G®(T^).  For  an  arbitrary  r  >  0  one  has 


tAV®  2  _ 


m=0  '  ^ 

Now  let  p  and  M  be  such  that  (6.4)  is  satisfied.  By  interpolating  (6.4)  between 
n  ==  0  and  any  integer  n  such  that  m/s  <  2n,  the  inner  sum  appearing  in  (6.8)  can 
be  bounded  as 

m 

(6.9)  + 

e  ^ 

This  bound  is  best  if  we  choose  n  =  rim  =  W^/^^)]  +  1?  where  [•]  denotes  the 
“greatest  integer  less  than”  function.  Upon  making  this  choice  and  applying  the 
result  in  (6.8),  one  arrives  at  the  bound 

m  m 

(6.10)  ll.-'-Hli.  <  f  <2^  -  M"  E  (f  )"^  ■ 

m=0  ^  m=0  '' 

By  making  use  of  the  Stirling  formula  [1]  in  the  form 


(6.11) 


e  i 

lim  —  (n!)  n  ==  1 
n^oo  n 


the  limit  of  the  root  of  the  term  in  the  last  series  of  (6.10)  can  be  evaluated 


lim  — 

m— ^oo  p 


2r  (n^!)«m  2r 


_  2t  1  _  r 
m  p  2s  sp 


Hence,  by  the  Hadamard  root  test,  the  series  in  (6.10)  converges  for  every  r  <  ps, 
whence  w  €  □ 

Remark  6.2.  The  above  proof  gives  a  sharp  relationship  between  the  r  of 
(6.5)  and  the  p  of  (6.4).  Had  we  been  less  careful  in  our  choice  of  the  rim  used  in 
(6.10)  then  this  relationship  would  have  been  missed. 

Remark  6.3.  The  theorem  also  holds  over  M^.  The  proof  can  proceed  in  the 
same  way,  using  Fourier  integrals  in  place  of  Fourier  sums. 
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One  reason  Gevrey  classes  are  useful  in  the  context  of  nonlinear  partial  differ¬ 
ential  equations  is  that  each  G®  is  closed  under  multiplication.  It  would  be  particu¬ 
larly  useful  if  this  property  extended  to  each  of  the  approximating  (normed!)  spaces 
The  following  theorem  states  that  under  certain  conditions  this 

is  indeed  the  case. 

Theorem  6.3.  >  1;  r  >  0,  and  r  >  d/2  then  is  a  Ba¬ 

nach  algebra.  This  means  that  it  is  closed  under  multiplication  and  that  there  exists 
a  finite  constant  G(r,  d)  such  that  any  two  functions  v  and  w  in  ^ 

satisfy  the  inequality 

(6.13)  <  C{r,d)  . 


The  proof  is  a  direct  extension  of  the  usual  proof  that  is  a  Banach 

algebra  when  r  >  d/2  [2],  a  result  that  is  recovered  above  by  setting  r  =  0.  A 
proof  of  this  theorem  for  the  case  when  s  =  1  is  given  in  [27];  this  proof  is  easily 
generalized  for  any  s  >  1. 

The  spaces  :  i?^(T^))  are  naturally  suited  for  the  application  to  par¬ 
abolic  equations:  r  will  be  identified  with  time  so  that  evolves 

from  being  identical  to  at  r  =  0  to  being  a  subset  of  Gevrey  class  s  in  an 
arbitrarily  short  time.  So  if  one  can  show  that  for  some  T  >  0  the  solutions  to  the 
equation  with  initial  data  are  in  Vie^^  ^  :  i3r^(T‘^))  for  every  t  €  (0,  T]  then 
one  has  proved  Gevrey  regularity  of  class  s  over  that  interval. 

Let  us  now  proceed  to  show  that  the  solution  u  of  the  CGL  equation  is  of 
Gevrey  class  1 — in  other  words,  is  real  analytic.  To  do  this  we  employ  the  time 
dependent  norm  As  the  norm  of  u  is  already  controlled  by 

(7.2),  it  is  sufficient  to  derive  a  bound  on  the  seminorm  ||A’"u||f .  A  direct  calculation 
gives 


(6.14) 


=  Re(y  dx 

+  Re  (y  A^e*-^dtu  A^e*^u*  dx) 

-  Re((l  +  iti)  j  A’-e*'^(|«p‘^u)  A^e*^u*  dx)  . 

The  first  term  on  the  right  is  easily  estimated  by  using 

(6.15)  =  {v,Av)j.,  <  ||v||^.  ||At;||^,  <  +  \\Av\\l, , 


with  V  —  A^e^^u.  By  using  the  Cauchy-Schwarz  inequality  on  the  integral  in  the 
last  term  of  (6.14),  one  finds  that  it  is  bounded  by 


(6.16)  ||A’-(«"+i(M‘)'')||t  IIAVIIt  <c\\A^nW}^+\ 


It  is  here  that  we  use  the  fact  that  I>(e*^ :  ff’”(T‘^))  is  an  algebra.  Then  finally 

(6.17)  l±\\A^u{t)\\!  <  (R+  I)  \\A^u{t)\\^  +  c\\A^umr^^- 

This  differential  inequality  is  easily  integrated  and  shows  that  ||A’"u(t)||t  remains 
finite  on  some  interval  [0,  T] .  Given  that  a  is  an  integer,  the  local  existence  theorems 


174  C.  DAVID  LEVERMORE  AND  MARCEL  OLIVER 

of  the  last  section  guarantee  that  u{t)  6  if’'(T^)  for  any  r  and  t  >  0  as  long  as  it 
exists.  Hence  by  initializing  (6.17)  with  ||u(^)||i^^  =  for  some  r  >  d/2 

and  every  such  t  one  can  conclude  the  following. 

Theorem  6.4.  Provided  that  a  is  a  positive  integer,  a  local  classical  solution 
is  real  analytic  in  x  at  any  positive  t  as  long  as  it  exists. 

Remark  6.4.  In  order  to  achieve  global  bounds  with  the  method  presented 
here,  it  is  necessary  to  have  a  priori  control  for  some  r  >  d/2.  The  method  used 
originally  by  Doelman  and  Titi  [22]  requires  only  a  priori  control  independent 
of  d.  However,  that  method  requires  a  separate  and  fairly  involved  estimate  on 
the  nonlinear  term  because  is  not  an  algebra  for  d  >  1.  See 

Theorem  8.4  for  the  statement  of  the  global  result. 

7.  Global  Classical  Solutions 

The  task  of  this  section  is  to  show  that  there  are  regions  of  the  /ii/-plane  where 
we  can  elevate  the  local  classical  solutions  found  in  Section  5  to  global  classical 
solutions.  In  these  regions  the  CGL  equation  defines  a  dynamical  system  and  we 
can  investigate  the  properties  of  its  global  attractor.  The  values  of  /i  and  i/  for 
which  we  establish  this  regularity  will  depend  on  the  spatial  dimension  d  and  the 
degree  of  nonlinearity  a.  The  results  of  the  last  sections  were  very  general — for 
example,  we  have  the  same  local  existence  and  regularity  results  for  either  sign  of 
the  real  part  of  the  coefficient  of  the  nonlinear  term— and  assert  substantially  little 
more  than  the  fact  that  the  problem  is  well-posed.  The  estimates  and  results  in  this 
section,  however,  will  depend  much  more  delicately  on  the  structure  and  dynamics 
of  the  CGL  equation.  They  rely  not  only  on  the  signs  of  the  dissipative  terms,  but 
on  the  relative  signs  of  the  coefficients  of  the  dispersive  terms  as  well. 

As  was  pointed  out  in  the  final  remarks  of  Section  5,  it  suffices  to  obtain 
global  control  of  any  norm  where  p  >  ad.  Such  bounds  can  be  obtained  by 
computing  the  time  evolution  of  the  norm,  say,  and  then  perform  estimates  on 
the  various  terms  in  order  to  obtain  closed  differential  inequalities.  This  shall  be 
first  demonstrated  for  the  L’^  norm  which  is  simplest  to  control,  A  direct  calculation 
and  subsequent  application  of  the  Holder  inequality  yields 

^|2a+2  _  J  I  Vwp  dx 

)cr+l 

. 

In  the  appendix  it  is  shown  that  for  a  >  0  this  differential  inequality  implies  the 
norm  of  u  satisfies 

X 

(7-2) 

This  estimate  is  independent  of  the  initial  data  and  decays  to  as  t  ^ 

00.  Hence  for  times  bounded  strictly  away  from  0,  the  norm  of  u  is  bounded 
uniformly  in  both  time  and  initial  condition.  Whenever  ad  <2,  Theorem  5.6  then 
implies  that  the  CGL  equation  has  global  classical  solutions  for  all  initial  data  in 
For  ad  >  2,  one  must  control  more  than  the  norm. 


J  dx  =  R  J  dx  ~  j 


<R  \ur  dx 
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Directly  computing  the  evolution  of  the  norm  gives 


lul^dx 


\u\^  ^u'^Audx 


u*Audx 


The  net  contribution  of  the  last  two  terms  on  the  right  side  will  be  nonpositive  for 
p>2  provided 

p.4)  M<?^. 

This  fact  can  be  established  many  ways — either  using  Sobolev  bounds  as  in  [3], 
quadratic  forms  as  in  [25],  or,  as  will  be  done  here,  as  a  consequence  of  Lemma  7.3. 
Upon  supposing  (7.4)  is  satisfied,  the  last  two  terms  on  the  right  of  (7.3)  can  be 
neglected  while  the  remaining  terms  can  be  treated  through  an  application  of  a 
Holder  inequality  analogous  to  the  one  for  the  LF'  estimate  (7.1)  thereby  yielding 
the  inequality 


r 

pdt  J 


\Pdx<R  \u\Pdx 


\u\^  dx 


The  appendix  again  gives  a  uniform  bound,  identical  to  (7.2), 


The  existence  of  global  classical  solutions  is  then  guaranteed  by  Theorem  5.6  in 
the  subcritical  case  (ad  <  2)— as  mentioned  before — by  taking  p  =  2.  In  the  critical 
case  {ad  =  2)  one  can  still  satisfy  ad  <  p  and  (7.4)  simultaneously  by  choosing  p 
sufficiently  close  to  2.  In  the  supercritical  case  [ad  >  2)  however,  the  best  we  can 
do  is  to  choose  p  sufficiently  close  to  ad,  making  (7.4)  a  restriction  on  the  possible 
values  of  v  for  which  the  existence  of  global  classical  solutions  can  be  established. 
This  is  summarized  in  the  following. 

Theorem  7.1.  For  a  >  0,  the  generalized  CGL  equation  with  (7^  initial  data 
has  unique  global  classical  solutions  provided  either  that  ad  <2  or  that  ad>  2  and 
u  satisfies 

(7.7)  1^1  <2v^^ 

aa  —  Z 

Thus,  in  the  supercritical  case  the  CGL  equation  has  global  classical  solutions 
only  when  i/  lies  in  the  strip  around  the  //-axis  of  the  ////-plane  given  by  (7.7)  while 
fi  may  take  on  any  parameter  value. 

Remark  7.1.  When  global  classical  solutions  exist,  estimate  (7.6)  shows  that 
the  attractor  is  confined  to  the  closed  ball  of  radius  in  provided  p  satisfies 

(7.4).  Furthermore,  estimate  (7.6)  is  independent  of  the  initial  condition.  This 
means  that  in  an  arbitrarily  short  time  all  solutions  enter  some  ball  of  finite  radius. 


Remark  7.2.  The  basic  differential  inequalities  (7.1)  on  the  norm  of  u  and 
(7.5)  on  the  norm  are  sharp  for  the  spatially  homogeneous  solutions  (2.10)  of 
the  CGL  equation.  Moreover,  their  proofs  use  essentially  the  boundedness  of  the 
domain  and  do  not  extend  to  R^.  However,  a  bound  on  the  norm  of  u  over 
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that  is  exponentially  growing  in  time  can  be  obtained  provided  (7.4)  is  satisfied 
by  neglecting  the  last  three  terms  on  the  right  of  (7.3).  Such  a  bound  is  enough  to 
infer  that  Theorem  7.1,  as  stated,  holds  over  R"^. 

An  alternative  to  directly  controlling  the  norm  is  to  rather  first  directly 
control  the  norm  and  then  control  through  Sobolev  estimates.  This  is 
however  subject  to  the  constraint 

{oo  for  d  =  1  or  2 , 

^  f«,d>3, 

d-2 

Given  that  we  are  also  constrained  by  ad  <  p,  this  restricts  us  to  those  cases  where 


(7.9)  d<2  +  -, 

which  is  incidentally  the  condition  for  local  well-posedness  of  the  underlying  NLS 
problem,  equation  (3.7). 

An  estimate  on  the  norm  can  be  derived  in  a  very  similar  way.  As  the 
norm  is  already  controlled  by  (7.2),  it  suffices  to  control  the  norm  of  Vw.  To  do 
so  we  assume  a  >  ^  so  that  Theorem  5.4  gives  enough  regularity  of  the  solution  to 
take  the  gradient  of  the  CGL  equation.  A  direct  calculation  then  shows 


=  J  J  \Au\‘^  dx 


u\^^u*Audx]  +/ilm(  /  \u\‘^^u* Audx 


(7.10) 


The  last  two  terms  on  the  right  have  the  same  form  as  those  on  the  right  side  of 
(7.3)  upon  identifying  p  with  2cr  +  2  and  i/  with  —p>.  Upon  treating  them  in  the 
same  way,  their  net  contribution  will  be  nonpositive  provided  (compare  with  (7.4)) 


(7.11) 


'2cr+l 


In  that  case,  these  terms  can  be  neglected,  and  an  elementary  interpolation  in¬ 
equality  applied  to  the  second  term  (i.e.  simply  integrate  by  parts  and  use  the 
Cauchy-Schwarz  inequality)  leads  to 


\Vu\^  dx  <  R  /  jVupdx 


J \Vu\^dx 

J  \u\'^dx 


Upon  using  the  bound  for  u  provided  by  (7.2),  this  differential  inequality  gives 
uniform  upper  bounds  on  the  norm  of  Vu  by  following  the  procedure  in  the  ap¬ 
pendix.  Indeed,  even  in  this  case  the  differential  inequality  (7.12)  can  be  integrated 
exactly  to  obtain 


(7.13)  ll^'^WIIx/2  ^  ^{a,Rt)  _  ^-2aRt  j  ’ 

where  $  is  given  in  terms  of  the  hypergeometric  function  F  (e.g.  see  [1])  by 
- - -  r.^\  2(1 +  <r) 


$(cr,  Rt)  = 


F(l,  1, 2  +  1  - 
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The  function  $(<7,  Rt)  is  monotonically  decreasing  in  t  and  one  can  easily  find  that 
$(<j,  0)  =  2(1  +  a)  and  ^(a,  oo)  =  2.  Thus  (7.13)  provides  a  uniform  bound  on 
||Vw||x,2  analogous  to  the  estimate,  equation  (7.6).  Then  Theorem  5.6  shows  the 
existence  of  global  classical  solutions  in  a  strip  around  the  zz-axis  of  the  parameter 
plane  (7.11)  similar  to  the  strip  around  the  jU-axis  (7.7)  from  Theorem  7.1.  In  the 
case  when  p  =  2  +  2(7,  these  strips  are  completely  symmetric,  and  both  subject  to 
the  restriction  d  <  2  2/a. 

However,  for  no  p  >  2  does  the  union  of  these  strips  extend  to  all  of  the 
modulationally  stable  region  of  the  underlying  NLS  equation  where  the  energy 
functional  is  positive  definite  and  hence  no  finite-time  singularities  can  occur.  As 
the  CGL  equation  is  a  dissipative  perturbation  of  the  NLS  equation  (disregarding 
for  the  moment  the  linear  growth  proportional  to  R),  we  expect  to  have  regularity 
at  least  in  this  region.  And  indeed,  a  “perturbation”  of  the  NLS  energy  functional, 

(7.15)  F=  f  \Vu\^  +  ^\u\'^-+^dx, 

J  cr  H-  1 

where  /3  >  0  is  to  be  chosen  later,  will  help  to  considerably  improve  the  region 
for  which  global  existence  of  classical  solutions  can  be  shown.  The  technical  reason 
why  this  particular  combination  yields  better  results  than  separate  estimates  on  the 

and  norms  is  that  the  problematic  terms  in  the  and  estimates 

have  the  same  functional  dependence  on  the  solution  u  and  therefore  taking  the 
linear  combination  (7.15)  and  optimization  in  provides  for  partial  cancelation  of 
these  terms.  This  is  reflected  in  the  following  result,  the  present  form  of  which 
is  due  to  Ginibre  [37]  and  slightly  improves  upon  the  region  of  validity  given  in 
[3,  25], 

Theorem  7.2.  For  a  and  d  <2-\-2/a,  the  generalized  CGL  equation  with 

initial  data  has  unique  global  classical  solutions  if  the  dispersive  parameters  lie 
in  the  region  of  the  pu -plane,  bounded  by  hyperbolae, 

l-\-  pu  ^  yj2a  -h  1 
\p-iy\~  a 

This  region  of  the  pu-plane  completely  contains  the  first  and  third  quadrants 
where  the  underlying  NLS  equation  is  modulationally  stable,  the  strips  found  from 
the  previous  separate  and  bounds  (this  by  the  very  form  of  F).  In 

addition,  it  contains  the  whole  CGL  modulationally  stable  region  1  pu  >  0  and 
a  bit  more  depending  on  the  degree  of  nonlinearity.  Indeed,  for  d  =  1, 2  it  reduces 
to  the  modulationally  stable  region  in  the  limit  of  a  high  degree  of  nonlinearity 
((7  ^  oo).  However,  part  of  the  strip  (7.7)  obtained  from  a  direct  bound  lies 
outside  this  region  when  p  >  2  T  2a.  This  is  summarized  in  Table  7.1  for  integer 
values  of  a. 

Figure  7.1  illustrates  the  supercritical  case  d  =  S,  a  ~  I;  the  corresponding 
picture  for  any  case  where  2  <  ad  <  2  +  2a  is  similar.  The  inner  pair  of  hyperbolae 
bounds  the  modulationally  stable  region  1  piy  >  0  while  the  outer  pair  limits 
the  region  where  global  classical  solutions  can  be  obtained  from  Theorem  7.2.  The 
little  unshaded  strip  outside  the  hyperbolic  boundaries  indicates  the  region  where 
global  classical  solutions  that  can  still  be  obtained  by  using  the  direct  result, 
Theorem  7.1. 

Remark  7.3.  So  far  we  have  allowed  for  non-integer  values  of  a  which  is  suf¬ 
ficient  to  obtain  global  classical  solutions  that  are  in  the  spatial  variables.  For 


(7.16) 
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cr  =  1 

^7  =  2 

cr  >  3 

1 

no  restriction 

no  restriction 

2v'(t  -  1 

M<  ^_2 

1  -{-  /XI/  ^  \/2a  -h  1 

-  (7 

d  =  2 

no  restriction 

\i/\  <  V3  or 

1  -h  /XI/  ^  \/5 
\fi-v\-  2 

yj2G  -  1 

H<  or 

1  4-  /XI/  ^  \/2cr  1 

\lJ.-v\-  a 

d  =  3 

\i/\  <  y/S  or 

M<-^ 

,  ,  ^  2v'3^^ 

3<r-2 

d>4 

1  1^  Vd-1 

d-2 

,  ,  ,  v^2d-l 

,  ,  2\/ad  —  l 

.d-2 

Table  7.1.  Sufficient  restrictions  for  the  existence  of  global  clas¬ 
sical  solutions  to  the  complex  Ginzburg-Landau  equation.  The 
bounds  on  \u\  and  the  “no  restriction”  entries  are  a  result  of  The¬ 
orem  7.1  while  the  hyperbolic  boundaries  arise  from  Theorem  7.2. 
The  case  where  d  =  3  and  cr  =  1  is  illustrated  in  Figure  7.1. 


a  >nf  2  one  can  even  get  (7"^+^  solutions  from  Theorem  5.4,  but  for  global  smooth 
(i.e.  solutions  cr  needs  to  be  integer.  This  case  will  the  the  focus  of  Section  8 
where  we  derive  explicit  global  bounds  for  all  spatial  derivatives. 

In  order  to  prove  the  results  of  this  section,  it  is  useful  to  have  the  following 
bounds  on  nonlinear  functionals  that  occur  throughout  the  proofs.  They  were  first 
used  by  Ginibre  [37]  to  obtain  the  improved  hyperbolic  boundaries  (7 .16) . 

Lemma  7.3.  For  p  >  2  and  a  function  u  G  i7^(T^)  n  ^(T^)  the  following 
inequalities  hold: 


(7.17) 

(7.18) 


1  \u\^  '^u*Audx^ 

^  27^-1 

[  \u\^  ‘^u^Audx 

P 

J 

[  \u\^~'^u*Audx] 

<  P-2 

f  \u\P-~^u*Audx 

/  '  '  J 

P 

J 
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Figure  7.1.  Parameter  plane  for  the  supercritical  complex  Ginz- 
burg-Landau  equation  (shown  here  is  the  case  d  =  3,  cr  =  1). 
Global  existence  of  smooth  solutions  is  established  in  the  unshaded 
region  only.  The  inner  hyperbolae  are  the  boundaries  of  the  mod- 
ulationally  stable  region  1  fiu  >  0.  The  region  within  the  outer 
pair  of  hyperbolae  is  a  result  of  Theorem  7.1  while  the  small  un¬ 
shaded  strip  outside  this  region  is  a  result  of  Theorem  7.2.  In  the 
sub  critical  and  critical  case  {ad  <  2)  this  strip  extends  to  the  whole 
parameter  plane. 


Proof.  Examine  the  identity,  obtained  through  integration  by  parts, 


(7.19) 


2 


J  \u\^  '^u*Audx 


—p J  |w|^ 

-{p-2)J  M^-V^ 


Vu  •  Vudx . 


For  any  nonconstant  u  the  first  term  on  the  right  side  is  negative  while  the  second 
term  is  smaller  in  magnitude  by  a  factor  no  greater  than  {p  —  2)/p,  whereby  it 
follows  that 


(7.20) 


J  \u\^  '^u*Audx  =  — 


^  ^u*Audx 


exp{i6) , 


for  some  0  in  |^|  <  7r/2  such  that  |sin^|  <  (p  —  2)/p,  or  equivalently,  such  that 
cos^  >  2v^r^^/p.  By  (7.20),  these  trigonometric  bounds  on  6  translate  into  the 
bounds  (7.17)  and  (7.18).  □ 


This  result  is  more  than  enough  to  provide  the  missing  links  in  the  deriva¬ 
tions  of  the  differential  inequalities  (7.5)  and  (7.12)  and  it  also  enables  us  to  prove 
Theorem  7.2. 
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Proof  of  Theorem  7.2,  We  compute  the  time  evolution  of  F  by  taking  the 
appropriate  linear  combination  of  (7.3)  with  p  =  2cr  +  2  and  (7.10): 


2  dt 


j \Vu\^ +0^\u\^‘^+'^dx-  y|A«p  +  ^2|y|4^+2^^ 
(l  +  /3^)  ^®(y  |wp‘^u*Audx)  +  {ij,  —  0^u) 


\u\'^^u*Audx 


The  benefits  of  this  linear  combination  are  immediately  obvious,  noting  that  when 
fi  and  1/  have  the  same  sign,  one  can  always  find  a  (3  that  makes  the  coefficient  of 
the  last  term  zero.  In  order  to  obtain  an  optimal  result,  we  bound  the  last  two 
terms  by  Lemma  7.3.  In  addition  to  that,  one  can  use  a  fraction  k  of  the  negative 
definite  term  to  help  control  the  last  two  terms.  This  is  best  done  through  the 
estimate 

(7.22)  -  j \Au\'^  +  0^\u\^'^+^dx<-2p\^j\u\'^^u*Audx  . 

Altogether,  one  obtains  the  differential  inequality 


(7.23) 


2dt 


J  \Vuf -\- dx  ~  {1  -  k)  j \Au\^ 0^\u\ 

(2kP  +  (1  +  -\l^-  /  I 


2\  \/2cr  +  1 


Audxl  , 


The  last  term  can  be  neglected  provided  it  is  possible  to  choose  P  such  that  its 
coefficient  is  nonnegative.  This  is  always  the  case  if  >  0.  On  the  other  hand, 
when  <  0  we  can  without  loss  of  generality  consider  the  case  where  >  0  and 
IV  <  0.  Taking  k  =  1^  this  results  in  the  condition 

(7.24)  2/3+(1+/32)^/^I-(;.-^2,.)^>0. 

Optimization  in  p  then  results  in  the  restriction  to  the  hyperbolic  parameter  region 
(7,16).  As  we  have  used  up  all  of  the  negative  definite  term,  one  obtains  a  bound 
on  F  that  is  exponentially  growing  in  time,  sufficient  for  Theorem  5.6  to  assert  the 
existence  of  global  classical  solutions.  □ 

Remark  7.4.  For  those  fj,  and  p  for  which  (7.16)  holds  as  a  strict  inequality, 
we  can  keep  k  <  1  and  obtain  uniform  bounds  on  the  and  norms  by 

closing  the  differential  inequality  (7.23)  as  follows: 


(7.25) 


<  1  + 


((T  +  1)2 

/32 

(<r  +  l)2 


(y  |Vupdx)  +  (/?^  + 

y  iup  dx  y  I  Aup + p 


((T+  1)  = 


2.  |4.+2 


In  the  second  step  we  have  used  integration  by  parts  and  Cauchy- Schwarz  on  the 
first  and  the  Cauchy-Schwarz  inequality  on  the  second  term.  Hence,  provided  p  can 
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be  chosen  so  that  (7.24)  holds  as  a  strict  inequality,  F  will  satisfy  the  differential 
inequality 


(7.26) 


(<j  + 1)^ 


Upon  again  using  the  bound  provided  by  (7.2),  this  differential  inequality  may 
be  integrated  as  in  the  appendix  to  obtain  the  uniform  bound 


(7.27) 


((7  +  1)"+/?^  f  R  \  - 

(1  — k)((j  +  1)  — 


This  bound  has  the  same  scaling  in  i?  as  t  ^  oo  as  the  bound  (7.13).  Note  that 
jS  and  K  are  subject  to  the  constraint  that  the  coefficient  of  the  last  term  of  (7.23) 
be  nonnegative  and,  hence,  are  not  independent.  The  task  of  optimizing  the  choice 
of  p  and  K  in  the  prefactor  of  (7.27)  is  fairly  complicated  and  gives  little  additional 
information.  However,  we  conclude  by  observing  that  this  prefactor  blows  up  as 
K  approaches  unity,  which  will  be  the  case  as  approaches  the  hyperbolic 

boundaries  of  the  region  (7.16). 


8.  Global  Sobolev  Bounds 

This  section  is  devoted  to  the  derivation  of  global  bounds  on  all  spatial  deriva¬ 
tives  of  the  global  smooth  solutions  found  in  the  last  section.  As  a  consequence 
we  will  infer  that  the  CGL  equation  possesses  a  compact  global  attractor  when 
considered  as  a  dynamical  system  over  H’^(T^).  It  will  be  assumed  throughout  that 
cr  is  a  positive  integer,  so  that  the  local  existence  of  solutions  is  ensured  by 
Theorem  5.3,  i.e.  that  the  global  classical  solutions  are  in  fact  C'°®((0,  oo)  x  T^). 
This  does  not  yet  give  us  uniform  global  bounds  on  norms  of  the  derivatives  (al¬ 
though  some  bounds  might  be  derived  by  a  subtle  analysis  of  the  bootstrapping 
arguments  of  Section  5).  Here  we  explain  how  to  directly  control  the  norms  of  the 
Sobolev  spaces  ^^^(T^^)  and  how  to  use  those  bounds  to  obtain  global  estimates  on 
the  analytic  regularity  of  the  solution. 

Bartuccelli  et  al  [3]  first  estimated  the  norms  of  solutions  to  the  cubic 
CGL  equation  by  constructing  a  “ladder” — a  sequence  of  nonlinear  functionals, 
each  of  which  controls  successively  higher  derivatives  of  u,  and  that  are  connected 
by  differential  inequalities.  The  method  was  subsequently  applied  to  the  generalized 
CGL  equation  [25]  and  the  two-dimensional  incompressible  Navier-Stokes  equation 
[23] .  Recent  work  has  shown  that  it  is  possible  to  find  a  differential  inequality  that 
yields  global  control  directly  from  any  of  the  global  estimates  of  Section  7, 
i.e.  for  any  n  without  the  need  for  intermediate  rungs  and  functionals  [4].  This 
form  of  the  estimate  is  presented  below. 

As  the  norm  of  u  is  already  controlled  by  (7.2),  it  is  sufficient  to  estimate 
the  LP  norm  of  Here  is  considered  to  be  a  symmetric  n-tensor  and 

(8.1)  |V^u|2  = 

where  the  dot  denotes  the  usual  tensor  inner  product  which  acts  on  a  symmetric 
fc-tensor  and  a  symmetric  Ltensor  by  contracting  over  min{fc,  1}  indices  to  give  a 
symmetric  \k  —  Z| -tensor. 
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OllUWO 


(8.2) 


A  direct  calculation 


II  -  It  ” 

-  Re^(l  +  in)  j  V^CIwp'^u)  •  V"w*  do^  . 
right  is  easily  estimated  using  the  Sobolev  inequality 

n  1 

WTjn^.W  ^ 


The  second  term  on 

(8.3)  ||V"«||^.  <  ||V"+iu||^  llt^ll^  . 


More  work  is  required  to  achieve  a  bound  on  the  nonlinear  term.  The  estimate  we 
will  use  is 


(8.4) 


V"«*  dx 


2ap 

<«I|V"+'«||L+c(5)||V"«||L  IMI^rL 


In  this  formula  (which  will  be  proved  later  as  Lemma  8.3)  we  can  choose  S  >  0  and 
p  >  ad  to  suit  our  purpose.  In  particular,  by  setting  6  =  we  can  cancel 

the  first  term  with  half  of  the  dissipative  term  of  equation  (8.2)  to  obtain 


(8.5) 


£ 

dt 


1^  < 
Il2  ^ 


2-R  4"  c 


2ap 


l|V"«|| 


||Vn^||2("+l)/'* 


IL2 


L2 


L2 


One  can  use  the  a  priori  bound  (7.2)  to  control  the  denominator  of  the  last 
term.  It  is  then  clear  that  global  control  with  p  >  ad — note  that  this  is  precisely 
the  condition  for  local  regularity  from  data — ^is  sufficient  to  achieve  global 

control  for  any  n.  Explicit  bounds  can  be  obtained  by  following  the  procedure  in 
the  appendix,  although  in  general  it  will  not  be  possible  to  integrate  the  differential 
inequality  exactly.  Hence  we  state  the  result  in  its  simplest  form. 


Theorem  8.1.  For  all  positive  integers  d,  a  and  n  and  for  every  p  >  ad, 
when  pL  and  u  lie  in  the  region  of  the  parameter  plane  where  uniform  bounds  on 
solutions  of  the  CGL  equation  with  initial  data  exist,  these  solutions  are  global 
and  smooth  for  all  positive  times  with  their  norm  bounded  uniformly  for  all 
initial  data  and  t  ^  oo,  satisfying  the  differential  inequality  (8.5). 

Remark  8.1.  Estimate  (8.5)  does  not  give  the  best  scaling  in  all  cases,  al¬ 
though  known  improvements  are  slight.  For  a  more  detailed  discussion  see  [4]. 

Remark  8.2.  Nonlinear  dissipation  enters  estimate  (8.5)  only  implicitly  via 
the  and  possibly  the  bound.  The  explicit  dependence  on  the  sign  of  the 
nonlinear  term  has  been  lost  in  step  (8.4).  This  in  particular  shows  that  by  the 
same  method  global  bounds,  although  not  necessarily  uniform  in  time,  can  be 
obtained  for  equations  of  the  type 

(8.6)  dtu  =  Ru  -f-  (1  -h  iu)  Au  —  ip,  \u\^^u . 

Remark  8.3.  Theorem  8.1  shows  that  the  CGL  equation  possesses  a  compact 
global  attractor  when  considered  as  a  dynamical  system  over  any  (For 

purposes  of  comparison,  it  is  natural  to  consider  both  the  NLS  and  CGL  equa¬ 
tions  as  dynamical  systems  over  i7^(T^).)  Indeed,  because  i7’^+^(T^)  is  compactly 
embedded  in  the  existence  of  an  absorbing  ball  in  iif^'^^(T^),  which  is  in¬ 

ferred  from  (8.5)  and  necessarily  contains  the  attractor,  implies  the  compactness  of 
this  ball  in  i7^(T^).  Furthermore,  by  the  general  theory  developed  by  Mallet-Paret 
[58]  and  Mane  [60]  the  attractor  has  finite  Hausdorff  dimension  and  can  be  locally 


THE  COMPLEX  GINZBURG-LANDAU  EQUATION  AS  A  MODEL  PROBLEM  183 


embedded  in  finite  dimensional  manifolds.  This  theory  is  reviewed  in  [42] .  Explicit 
upper  bounds  for  the  Hausdorff  dimension  of  the  attractor  were  derived  for  d  =  1 
by  Doering,  Gibbon,  Holm  and  Nicolaenko  [24],  for  d  =  1,  2  by  Ghidaglia  and 
Heron  [35]  and  for  d  =  3  by  Bartuccelli  et  al  [3].  The  techniques  are  based  on  the 
estimation  of  global  Lyapunov  exponents  which  are  discussed  in  [13,  14,  78] . 

Remark  8.4.  In  some  cases  the  attractor  can  be  globally  embedded  in  an  in¬ 
ertial  manifold,  a  notion  introduced  by  Foias,  Sell  and  Temam  [29].  An  inertial 
manifold  is  a  finite  dimensional,  exponentially  attracting,  positively  invariant  mani¬ 
fold  that  contains  the  global  attractor — for  a  general  discussion  see  [29,  15,  16]  and 
references  therein.  For  d  =  1,2  the  CGL  equation  is  known  to  possess  an  inertial 
manifold  while  for  d  >  3  this  is  still  an  open  question.  Moreover,  its  dimension  is 
bounded  by  the  attractor  dimension  from  below,  while  upper  bounds  can  be  proved 
following  [16,  24].  This  procedure  relies  on  finding  a  large  enough  gap  in  the  spec¬ 
trum  of  the  Laplacian,  leading  to  a  much  larger  estimate  on  the  dimension  of  the 
manifold  when  d  =  2  and  failing  for  d  =  3  as  there  are  no  arbitrarily  large  gaps 
in  the  spectrum.  By  contrast,  it  is  not  clear  whether  the  Navier-Stokes  equations 
in  two  spatial  dimensions  possess  an  inertial  manifold.  This  question  remains  an 
active  area  of  research. 

To  prove  estimate  (8.4),  we  first  derive  a  preliminary  estimate  in  which  we 
impose  that  the  Gagliardo-Nirenberg  inequalities  be  used  at  their  margin  of  validity, 
thus  rendering  the  estimate  independent  of  the  dimension  d.  This  result  will  then 
be  used  in  two  different  ways  to  complete  the  proof. 

Lemma  8.2.  For  all  positive  integers  d,  a  and  n  the  following  inequality  holds: 


I  dx  <  c{n,  q)  \\V”u\\l,,  ||, 


where 


^  <  oo . 


Remark  8.5.  The  advantage  of  this  result,  derived  in  [69,  4],  in  comparison 
with  earlier  estimates  [3,  25]  is  that  it  permits  a  free  choice  of  r  while  keeping  the 
estimate  independent  of  d. 

Proof.  By  moving  the  absolute  value  inside  the  integral,  performing  a  Leibniz 
expansion  on  V^(|up^u),  and  then  using  the  triangle  inequality,  one  obtains 

/j  „  2a+l 

aeN-'+i  •’  ^=1 

|aj=n 

Here  we  have  again  employed  the  multi-index  notation  in  which  a!  and  \a\  are 
defined  as  in  (6.2).  For  a  given  multi- index  a  with  \a\  —  n  define  Oj  and  qj  by 


(8.10) 

These  satisfy 


Za^l  ,  I  2(7+1  - 

^  +  53- 

h  ^  2g  5, 
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Then  the  corresponding  term  on  the  right  side  of  (8.9)  can  be  estimated  with  a 
Holder  inequality  followed  by  a  Gagliardo-Nirenberg  interpolation  on  each  of  the 
factors; 

/  2(7+1  2<7+1 

|V"k|  n  |V“^M|d®<||V"u||^„  n 


=  c\\V^n\\l,A\u\\%, 


Finally,  by  setting  this  into  (8.9),  we  obtain  the  stated  result.  □ 

This  result  will  now  be  used  to  establish  estimate  (8.4),  thereby  completing  the 
proof  of  Theorem  8.1. 

Lemma  8.3.  For  all  positive  integers  d,  a  and  n  and  for  every  p  >  ad  and 
every  6  >  0,  one  has  the  estimate 

2a  p 

(8.13)  y  •  V"u’  dx  <6  ||V"+in||L  +  <6)  ||  V”«||L  ■ 

Proof.  Estimate  the  right  side  of  (8.7)  with  a  Gagliardo-Nirenberg  inequality, 
followed  by  Young’s  inequality; 


(8.14) 


jy  V"(|«P"u)  ■  V"u‘  dx  <  Cl  ||V"+iu||f/  ||V"«||^/'''’  \\u\\%,^ 

<  ^  l|V”+'u||L  +C2(^)  ||V"u||L  ll«ll!^  • 


For  the  inequalities  to  be  valid,  we  must  have  2-  d/r  >  0.  By  setting  p  =  2ar,  we 
obtain  the  statement  of  Lemma  8.3. 

If  d  =  1  however,  the  condition  r  >  1  from  Lemma  8.2  would  restrict  the 
possible  choices  of  p  even  further,  so  that  for  ^  <  r  <  1  we  must  complement  the 
proof  with  a  different  interpolation  in  order  to  obtain  the  result  in  its  full  generality. 
Thus  we  estimate 


y  d”{\u\'^’^u)d’^u*  dx  <  C3  ||a"u||f„c  ||9’ 


2(1-A) 


<  Ci{\\d^+^u\\’l^  ll«llL“”) 

(8. 15)  ,  .  _ - . 

where  a  and  b  are  Gagliardo-Nirenberg  exponents  computed  in  the  usual  way.  By 
the  choice  of  A  1/r  -  1,  this  equation  reduces  exactly  to  (8.14)  so  that  the  proof 
can  proceed  as  before.  ^ 

Remark  8.6.  The  suggestive  grouping  of  terms  in  equation  (8.15)  does  not 
show  a  lower  order  term  that  has  to  be  included  in  the  estimate  of  \\u\\l2<x  because 
u  is  not  a  mean-zero  function.  However,  in  this  case  it  is  possible  to  absorb  the 
extra  term  into  the  constant  C4.  For  details  see  [4]. 
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Remark  8.7.  The  ladder  inequalities  introduced  in  [3]  and  generalized  in  [25] 
use  functionals  of  the  form 

(8.16)  Fn.m=/  + 

Jrd 

the  choice  of  which  is  motivated  by  the  observation  that  \u\^^  scales  dimensionally 
like  A  in  the  CGL  equation.  Terms  like  and  \u\^^u  have  the  same  weight 
under  this  dimensional  scaling.  This  however  does  not  give  any  advantage  over  the 
direct  approach  as  presented  here. 

Remark  8.8.  We  remark  that  although  the  implications  of  Theorem  8.1  add 
nothing  so  far  as  the  existence  of  global  smooth  solutions  is  concerned,  the  explicit 
estimates  that  have  been  derived  are  useful.  Some  of  these  estimates  are  discussed 
in  detail  in  references  [3,  4]. 

Finally  we  will  use  the  global  Sobolev  bounds  of  this  section  to  initialize  the 
local  Gevrey  class  estimate  of  Section  6.  Recall  that  the  differential  inequality 

(6.17)  shows  that  for  every  t  for  which  u{t)  G  there  exists  a  positive  finite 

time  Tjnax  such  that  for  every  T  G  [0,  T^ax] 

(8.17)  \\A^u(t  +  T)||^  <  2  P’-u(i)||o  <  2  ||u(t)||^. . 

We  know  from  Theorem  8.1  that  ||w||//r  is  bounded  uniformly  over  all  initial  data. 
One  therefore  has  the  following. 

Theorem  8.4.  For  positive  integers  d,  a  and  for  every  r  >  dji  and  p  >  ad, 
when  fi  and  v  lie  in  the  region  of  the  parameter  plane  where  uniform  bounds  on 
solutions  of  the  CGL  equation  with  initial  data  exist,  these  solutions  are  global 
and  real  analytic  for  all  positive  times.  Furthermore,  there  is  a  T^ax  such  that  the 
V{e^^:  iir^(T^))  norm  of  u{t)  is  bounded  uniformly  over  all  initial  data,  for  every 
T  G  [0,  Tjnax]  and  for  every  time  t>T. 

Theorem  8.4  implies  that  the  Fourier  modes  of  a  solution  to  the  CGL  equation 
decay  exponentially  with  the  wavenumber  $  with  a  rate  of  decay  which  is  uniform 
over  large  times.  Doelman  [21]  first  observed  that  the  Fourier  coefficients  decay 
faster  than  any  algebraic  order.  The  exponential  decay  rate  was  established  later 
in  [22]  by  using  methods  similar  to  the  ones  presented  in  Section  6.  This  is  con¬ 
sistent  with  numerical  observations  as  well  as  with  theoretical  results  concerning 
notions  of  finite-dimensionality  which  indicate  that  CGL  dynamics  is  relatively  low 
dimensional. 

Prom  the  computational  viewpoint  the  exponential  decay  of  the  Fourier  coeffi¬ 
cients  means  that  simple  Fourier- Galerkin  truncation  works  well.  The  CGL  dynam¬ 
ics  is  captured  by  a  manageable  number  of  Fourier  modes  [22,  26].  Furthermore, 
more  sophisticated  nonlinear  Galerkin  methods  offer  no  advantage  over  Fourier- 
Galerkin  truncation  [46],  a  fact  that  is  intrinsically  linked  to  analytic  regularity 
and  the  exponential  decay  of  the  Fourier  coefficients. 

A  general  question  of  current  interest  is  whether  the  solution  of  a  dynamical 
system  can  be  determined  by  the  time  history  of  a  finite  set  of  data.  The  notion 
of  determining  modes  was  introduced  by  Foias  and  Prodi  [28]  in  the  context  of  the 
two  dimensional  incompressible  Navier-Stokes  equations.  A  set  of  Fourier  modes 
is  called  determining  if  the  knowledge  of  the  time  history  of  the  corresponding 
Fourier  coefficients  completely  specifies  the  solution.  In  other  words,  if  the  deter¬ 
mining  modes  of  any  two  solutions  agree  then  the  solutions  are  identical.  This 
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idea  was  subsequently  extended  to  more  general  determining  degrees  of  freedom — 
for  example,  the  solution  evaluated  at  nodes  or  averaged  over  cells  [31,  11].  It  is 
important  to  note  that  the  dynamics  of  the  full  equation  is  not  necessarily  repre¬ 
sented  by  the  restricted  dynamics  on  a  set  of  determining  modes.  Hence,  a  low 
number  of  determining  degrees  of  freedom  has  no  direct  implication  for  computa¬ 
tional  considerations.  However,  when  an  inertial  manifold  exists  (as  for  CGL  when 
d  =  1,2),  a  finite  number  of  Fourier  modes  usually  serve  as  its  coordinates.  The 
infinite  dimensional  dynamics  of  the  remaining  Fourier  modes  is  therefore  slaved  by 
the  dynamics  of  a  finite  dimensional  set.  In  such  cases  one  can  generally  establish 
bounds  on  the  number  of  determining  degrees  of  freedom  in  terms  of  the  dimension 
of  the  inertial  manifold  [11].  Such  bounds  can  be  far  from  optimal.  Indeed,  the 
CGL  equation  can  exhibit  a  very  low  number  of  determining  degrees  of  freedom. 
Kukavica  [50]  showed  that  for  d  =  a  =  1  the  periodic  CGL  has  sets  of  determining 
nodes  consisting  of  only  two  points.  His  proof  readily  extends  to  the  case  where 
a  is  any  positive  integer.  Analytic  regularity  is  a  key  ingredient  in  the  argument. 
There  is  now  a  considerable  effort  to  extend  and  sharpen  these  results. 

Appendix.  On  the  Integration  of  Differential  Inequalities 

Throughout  Sections  7  and  8  we  infer  uniform  bounds  for  any  nonnegative 
quantity  F{t)  that  satisfies  a  differential  inequality  of  the  general  form 

(A.1) 

where  s  >  0  and  b{t)  >  0.  Following  Bernoulli,  introduce  the  variable  Y  = 
into  (A.l)  and  obtain  the  linear  differential  inequality 

(A.2)  ^  >  -s  CL{t)  Y  +  s  b{t) . 

Now  multiply  this  equation  by  exp(J^^/  sa{t")  dt")  to  obtain 

(A.3)  ^ 

Integration  of  (A.3)  between  0  and  t  results  in 

(A.4)  y(t)  >  y(0)  exp^— y*  sa{t')dt'^+  j  sb{t')  exp^—  sa{t'')  dt"^dt' . 

The  uniform  upper  bounds  on  F{t)  follow  directly  by  neglecting  the  Y (0)  term, 
which  effectively  means  taking  the  supremum  over  all  initial  conditions  F(0): 

(A.5)  F{t)  <  sb{t')  exp(^- sa{t")  dt"^dt' 

This  bound  is  satisfied  by  any  nonnegative  quantity  F{t)  that  satisfies  the  differ¬ 
ential  inequality  (A.l),  no  matter  what  its  initial  value  happens  to  be. 
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as  a  Modulation  Equation 
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1  Introduction 

The  complex  Ginzburg-Landau  equation  is  a  generic  amplitude  or  modulation  equa¬ 
tion  to  describe  many  physical  systems  with  dissipation  near  the  onset  of  instability, 
see  [NW69,  DES71]  and  [NPL93]  for  a  recent  review.  In  typical  situations,  such 
as  the  Rayleigh-Benard  or  the  Taylor-Couette  problem  in  hydrodynamics,  one  is 
concerned  with  a  translationally  invariant  problem  where  the  wavelength  of  the 
dominating  pattern  is  much  smaller  than  the  size  of  the  physical  domain.  In  such 
situations  it  is  advantageous  to  study  the  system  on  an  infinite  physical  domain; 
this  leads  to  new  phenomena  like  sideband  instabilities  which  have  their  counterpart 
in  mathematical  difficulties  due  to  continuous  spectra  and  noncompactness. 

The  main  idea  of  the  Ginzburg-Landau  formalism  is  to  describe  slow  spatial 
and  temporal  modulations  of  a  basic  periodic  pattern  by  the  solutions  of  a  simple 
partial  differential  equation  for  the  amplitude  function  A;  this  PDE  for  A  is  called 
the  amplitude  or  modulation  equation.  It  is  our  aim  to  present  an  introduction  to 
this  theory  by  considering  the  most  simple  example  giving  rise  to  this  sort  of  pattern 
formation.  Consider  the  following  parabolic  PDE,  which  is  a  mixture  of  the  Swift- 
Hohenberg,  the  Korteweg-de  Vries,  and  the  Kuramoto-Shivashinsky  equation: 

Ut  =  -(1  +  dl)^u  -h  (a  -h  l3dl)uj:  +  s^u  +  f{u)  +  g{u)ux  for  t  >  0,  x  e  M.  (1.1) 
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Here  u~u[t^x)  E  JR  is  the  state  variable,  and  /  and  g  are  smooth  functions  with 
f{u)  =  f2U^  +  +  0{u^)  and  g{u)  =  giu  +  g2U^  +  0{\u\^)  for  w  — >  0.  The 

parameter  e  is  small  and  measures  the  distance  from  criticality.  Linearizing 
at  u  =  0  shows  that  the  periodic  patterns  are  damped  for  |1  —  >  e  and 

they  are  unstable  in  the  other  case  with  growth  rates  less  than  The  aim  of 
the  Ginzburg-Landau  approximation  is  to  describe  the  nonlinear  evolution  of  these 
weakly  unstable  periodic  patterns.  For  this  purpose  solutions  are  sought  in  the 
form 


UX{t,x)=eA{T,X)e*^‘^^+^'> +C.C.,  with  T  =  sH  and  X  =  £{x  +  Cgt), 

where  A  is  the  slowly  varying  amplitude  of  the  pattern,  T  is  the  slow  time  scale,  and 
X  the  large  spatial  scale.  Formal  multiple  scaling  calculations,  which  are  explained 
in  Section  2,  show  that  the  associated  modulation  equation  for  A  is  the  complex 
Ginzburg-Landau  equation 

drA  =  adlA  -{-bA-  c\A\^A,  (1.2) 

where  the  coefficients  a,  6,  c  G  €  can  be  calculated  explicitly  from  (1.1). 

In  Section  3  we  discuss  the  question  whether  the  function  u{t,x)  =  U\{t)  is 
really  an  approximation  of  a  true  solution  u  of  (1.1).  The  theory  involves  two 
major  steps.  First,  we  show  that  a  suitable  approximate  solution  u  U\  has  a 
small  residual 

Res(u)  =  dtu  —  L{dx)u  —  A/’(£,  tt) 

in  a  suitable  function  space.  In  the  next  step,  a  differential  equation  for  the  error 
R~u  —  uis  established  in  the  form 

dtR  —  L{dx)RADuJJ'{e,u)R-\-M{€,u,R)  +Res(iI),  (1.3) 

where  M  —  (!?(||R|p).  From  (1.3)  the  error  has  to  be  controlled  by  using  some 
kind  of  Gronwall  inequality. 

The  second  step  goes  through  in  a  straight  forward  way  for  cases  where  the 
nonlinearity  starts  with  cubic  terms.  However,  the  case  of  quadratic  nonlinearities 
needs  a  more  refined  theory  involving  the  structure  of  Fourier  mode  interaction.  To 
this  end  we  define  mode  projections  to  split  u  into  Uc  +  Ug  where  Ug  contains  stable 
modes  and  Uc  the  critical  modes: 

Uc  =  PcU,  where  P{Pcu){k)  =  k  E  JR^ 

where  T  is  the  Fourier  transform  and  Xc(^)  =  1  for  |1  —  |A:||  <  1/4  and  0  elsewhere. 
In  order  to  handle  the  case  of  quadratic  nonlinearities  it  is  essential  to  use  the 
fact  that  the  quadratic  interactions  of  critical  modes  lead  to  stable  modes,  i.e. 
Pc{PcU  •  Pcv)  =  0.  The  main  result  is  contained  in  Theorem  3.6,  and  it  states 
that  for  each  solution  A  —  A{T)  of  (1.2)  existing  for  t  E  [0,To]  and  each  d  >  0 
there  axe  Sq.D  >  0  such  that  for  all  e  E  (0,eo)  the  following  is  true:  the  solution 
u{t)  =  S^{uo)  of  (1.1)  with  11^0  -  ^  exists  for  t  E  [0,To/e^]  and 

satisfies 
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on  this  interval. 

The  whole  mathematical  theory  of  modulation  problems  depends  very  subtle 
on  the  choice  of  the  underlying  function  spaces.  In  Section  3  of  this  paper  we  use 
H^{]R)  as  the  basic  phase  space  for  (1.1)  in  order  to  present  the  theory  as  simple 
as  possible.  Other  choices  are  spaces  of  bounded  continuous  functions  or 
which  were  used  in  [CoE90,  KSM92,  Sch94a,  Co94],  and  function  spaces,  which 
contain  analytic  functions  having  a  bounded  holomorphic  extension  into  a  complex 
strip  around  the  real  axis  (see  [vHa91]).  As  Fourier  transform  methods  play  an 
essential  role  it  is  also  convenient  to  characterize  the  functions  through  its  Fourier 
transform,  e.g.,  in  [Eck93]  the  functions  u  with  Tu  G  L^{1R)  fl  L^{]R)  were  used. 

We  prefer  the  spaces  iif/^^(iR)  which  were  introduced  in  [Sch94b,  Sch94c,  MS94]; 
they  consist  of  the  uniformly  local  Sobolev  spaces  H^{IR)  =  W^’^(iR)  and  contain 
all  kind  of  bounded  functions  such  as  periodic  or  quasiperiodic  spatial  patterns  as 
well  as  travelling  fronts.  Moreover,  they  are  based  on  and,  hence,  the  important 
tool  of  Fourier  transformation  is  still  available.  Another  nice  property  is  that  the 
global  existence  of  solutions  in  these  spaces  can  be  shown  for  many  classical  dissi¬ 
pative  system  by  the  weighted  energy  estimates.  This  theory  is  recalled  in  Section 
4.1.  Moreover,  we  give  a  global  existence  proof  for  (1.2)  with  Rec  >  0  in  Theorem 
4.1  and  derive  explicit  bounds  for  an  absorbing  ball  in  ^/^(JR). 

Furthermore,  Section  4  contains  a  brief  account  of  further  research  which  leads 
to  a  more  detailed  study  of  the  system.  In  Section  4.2  we  discuss  the  attractivity 
of  the  set  of  modulated  periodic  patterns.  It  was  first  observed  in  [Eck93],  that 
the  dynamics  of  the  system  is  such  that  all  small  solutions  develop  after  a  certain 
time  the  structure  of  a  modulated  periodic  pattern.  This  property  is  very  similar 
to  the  attractivity  of  finite  dimensional  center  manifolds;  of  course,  in  this  case  the 
existence  of  an  invariant  manifold  is  not  known  and  seems  very  unlikely. 

The  approximation  property  derived  in  Section  3  and  the  attractivity  property 
allow  us  to  follow  all  small  solutions  u  for  alH  G  [0,  oo)  by  approximate  solutions 
of  Ginzburg-Landau  type.  However,  every  approximate  solutions  is  only  valid  over 
a  long  but  finite  time  interval;  after  that  interval  the  error  between  the  present 
approximation  and  the  true  solution  may  become  large,  but  the  attractivity  tells 
us,  that  there  is  another  modulated  periodic  pattern  which  is  close  to  u.  Using  this 
new  modulated  pattern  as  a  starting  point  for  the  Ginzburg-Landau  approximation 
we  again  obtain  an  interval  of  good  approximation.  Thus,  it  is  possible  to  construct 
a  pseudo-orbit  for  (1.2)  which  shadows  the  true  solution  u  for  all  t  >  0.  In  [Sch94c] 
it  was  shown  that  the  jumps  in  the  pseudo-orbit  can  be  controlled  and  are  less  than 
in  the  ^-independent  Ginzburg-Landau  equation. 

A  recent  result  in  [MS94]  concerns  the  existence  of  an  attractor  for  (1.1)  and 
of  an  attractor  Ag  for  (1.2)  in  the  space  Note  that  these  attractors  cannot 

be  compact,  as  they  are  translationally  invariant,  nevertheless  they  are  attractors 
in  the  norm  topology  of  The  existence  theory  is  based  on  ideas  of  [Fei93]. 

Moreover,  it  is  possible  to  compare  the  attractor  with  Ag  in  a  certain  scaled 
way. 

In  this  brief  introduction  many  results  in  the  field  of  modulation  equations 
are  not  touched;  here  we  mention  a  few  of  them  in  order  to  show  the  richness 
of  the  theory.  Hyperbolic  problems  also  show  modulational  effects  and  the  most 


194 


ALEXANDER  MIELKE  AND  GUIDO  SCHNEIDER 


famous  example  is  the  nonlinear  Schrodinger  equation  which  describes  the  nonlinear 
evolution  of  the  amplitude  of  a  small  wave  packet  in  dispersive  hyperbolic  systems. 
See  [CaE87]  for  formal  derivations  and  [Kal89,  KSM92,  Sch94e]  for  a  proof  of  the 
approximation  property. 

Generalizations  of  the  above  theory  are  available  also  for  vector-valued  prob¬ 
lems  and  for  systems  on  cylindrical  domains  one  unbounded  spatial  direction.  The 
stationary  or  time— periodic  Navier-Stokes  equation  on  an  infinitely  long  cylinder 
was  treated  in  [IMD89,  IM91]  and  the  stationary  part  of  the  Ginzburg-Landau 
equation  was  obtained.  In  [Sch94b,  Bol94]  the  full  parabolic  system  was  treated 
and  the  Ginzburg-Landau  equation  was  shown  to  produce  valid  approximations 
for  the  Navier-Stokes  equations.  Recall  that  the  theory  of  modulation  equations 
was  originally  invented  to  analyze  exactly  these  problems  in  hydrodynamics,  see 
[NW69,  DES71]. 

Modulation  equations  can  also  be  treated  on  domains  with  two  unbounded 
spatial  directions.  In  these  systems  one  typically  observes  an  competition  between 
roll-like  structures  and  hexagonal  patterns;  we  refer  to  [NPL93]  for  a  physical 
review  on  this  topic.  In  [Sch94d]  the  Newell-Whitehead  equation  was  studied  as  a 
modulation  equation  in  the  sense  described  above.  However,  it  turns  out,  that  it 
generally  fails  to  describe  the  dynamics  of  the  original  problem  on  a  suitably  long 
time  scale. 

This  negative  example  gives  new  motivation  for  the  mathematical  study  of  the 
connections  between  weakly  nonlinear  systems  and  their  modulation  equations.  Not 
every  equation  derived  formally  is  equally  well  suitable  to  describe  the  full  system. 
The  mathematical  justification  is  one  tool  which  helps  us  to  decide  which  modula¬ 
tion  equation  is  useful.  In  building  up  a  catalogue  of  qualifying  and  disqualifying 
properties  of  modulation  equations  it  will  be  possible  to  support  the  process  of 
modelling  the  effects  in  pattern  forming  systems;  thus,  we  can  replace  expensive 
physical  or  numerical  experiments,  which  by  now  are  the  standard  tools  to  test  the 
validity  of  modulation  equations. 


2  The  formal  derivation 

In  this  work  we  focus  our  attention  to  one  of  the  simplest  model  problems  which 
can  be  treated  with  the  Ginzburg-Landau  formalism: 

Ut  =  -(1  +  dl)^u  -h  (a  +  pdl)ua:  +  e^u  -h  f{u)  -f-  g{u)u^  for  t  >  0,  x£  M.  (2.1) 

Here  u  =  u{t,  x)^  JR  is  the  state  variable,  and  /  and  g  are  smooth  functions  with 
f{u)  =  f2U^  +  hu^  +  0{u^)  and  g{u)  =  gm  -h  g2u‘^  +  <^(1^1^)  for  u  ->  0.  The 
parameter  £  is  considered  to  be  small,  such  that  measures  the  distance  from 
the  onset  of  instability.  Note  that  the  parameter  a  can  be  changed  easily  by  going 
into  a  moving  frame:  define  u(t,x)  —  u{t^x  —  ct),  then  u  satisfies  (2.1)  but  with  a 
replaced  by  a  -h  c.  We  will  make  use  of  this  fact  below. 

The  basic  solution  is  u  =  0  and  its  stability  can  be  studied  by  analyzing  the 
linearization 

Vi  =  dlfv  +  (a  +  I3dl)v^  -h 
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As  this  problem  is  independent  of  x  and  t  it  can  be  solved  by  exponential  solutions 
in  the  form  v{t,x)  —  where  k  e  M  and 

X{k)  ^ —{1  —  {a  —  l3k‘^)ik £^. 

The  exponential  growth  rate  is  given  by  the  real  part  of  A  and  we  immediately  see 
that  growing  solutions  correspond  to  |A;|  C  (\/l  ~  \/l  +  e).  The  most  unstable 

wave  number  is  where  the  maximum  of  Re  A  is  attained,  in  our  case  k  =  ±1. 

The  imaginary  part  uj{k)  =  lmX{k)  gives  the  phase  velocity  of  the  spatially 
periodic  pattern  For  k  =  1  we  find  a;(l)  =  a  —  (3.  For  modulated  patterns, 
which  can  be  understood  as  wave  packets  with  slightly  varying  wave  number,  the 
group  velocity  Cg  is  important;  it  is  defined  as  =  dcjjdk.  For  the  critical  mode 
k  —  1  we  find  Cg  —  a— 3/3.  As  the  only  stable  wave  numbers  are  in  a  e-neighborhood 
of  —  ±1  we  let  A:  =  1  +  eK.  Now,  the  solutions  can  be  expanded  as  follows: 

v{t  x)  =  =  g^[(Q!“/3)t+x]+i£[(a-3/3)/Ct+/Ca:]+e^[l-(4-fi3/3)A^]i+C>(e®)i 

Hence,  by  choosing  the  frame  speed  such  that  the  group  velocity  is  0  (i.e.  a  =  3/3), 
the  solution  is  given  as  the  basic  periodic  pattern  modulated  on  the  slow 

time  scale  T  —  e^t  and  on  the  large  spatial  scale  X  =  sx: 

v{t,x)  =  e[i-(4+^3/5)A^+0(.)]TgiKXE^  ^here  E  = 

For  fixed  we  will  use  the  special  notation  to  indicate  the  time-independent 
function 

In  a  weakly  unstable  nonlinear  theory  it  can  be  expected  that  the  dynamics  is 
dominated  by  the  critical  modes  from  linear  theory.  The  interaction  of  these  modes 
is  driven  by  the  nonlinear  terms  and  occurs  through  modulations  on  the  slow  time 
scale  T  =  and  the  large  spatial  scale  X  =  ex.  From  now  on  we  always  assume 
that  the  group  velocity  of  the  critical  pattern  is  zero,  i.e.  a  =  3/3,  Hence  we  are  led 
to  the  ansatz 

u{t,  x)  =  U'X  {t,  x)  +  0{e^+^) ,  with  ,  , 

x)  =  eA{T,  X)E  +  sA(T,  X)E  +  ^^=2  Zt-n ^  ’  > 

Here  A  is  the  amplitude  function  for  the  basic  periodic  pattern  E  =  ■j'jjg 

amplitudes  =  An^-i  are  the  amplitudes  of  the  powers  of  the  basic  pattern 
generated  through  the  nonlinear  interactions. 

The  Ginzburg-Landau  formalism  proceeds  now  as  follows:  we  insert  the  ansatz 
(2.2)  into  the  basic  underlying  equation  (2.1)  and  equate  the  coefficients  of  the 
powers  of  e^E”^  to  zero.  After  finitely  many  steps  we  obtain  a  closed  problem, 
which  allows  us  to  determine  the  amplitude  A.  Moreover,  we  may  increase  m 

and  obtain  a  hierarchy  of  equations  from  which  allows  correction  terms  can  be 

constructed. 

In  order  to  show  how  this  procedure  works  we  rewrite  (2.1)  in  the  form 

Cu  ^J\f{e,u)  =  e^u-]-Af2{u,u)  +  A/'3(u,w,u)  +  (!l(||(u,Ua:)||^), 
where  Cu  =  Ut  —  L{dx)u  =  Uj  +  (1  +  dl)‘^u  +  /?(3  +  (9^)ux, 

J^2{u,v)  =  f2UV  +  ^(Ua,U  +Ut;a;), 

■  As (w,  U,  w)  =  fsUVW  +  ^  {uVWy;  +  UV^W  +  UxVw) . 


(2.3) 
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The  nonlinear  terms  A4  are  written  as  symmetric  kAmear  forms.  By  elementary 
calculations  we  obtain  the  following  result. 

Lemma  2.1 

Letme  and  B  =  B{T,  X)  with  T  =  and  X  ^  ex.  If  B  is  suIEciently  smooth, 
then 

£[B(£H,  ea:)E™]  =  -  Ao{m)]B  +  6iAo(m)Bx 

+e^[BT  +  ^KMBxx]  -  e^X^^\m)^%B-s^l,\^^\m)^%B}, 


where  Ao(/^)  =  -(1  —  k^)^  +  ^(3  -  k^)ik,  k  e  M. 

Note  that  2f3im  -  Ao(m)  =  0  if  and  only  if  m  =  ±1;  this  fact  will  be  important  for 
constructing  the  hierarchy  in  An,h 

Inserting  (2.2)  into  the  nonlinear  terms  we  find 

UT)  =  (/2  +  2\A\‘^  +  +  ffi  [iA^E'^  -  ^E-^j) 

J^3{Ua,  Ua,  Ua)  =  ((/3  +  *P2)^®e3  +  (3/3  +  ig2)\A\^AE  +  c.c.)  + 

where  =  (2/2  +  i£?i)(^^2,o  +^^2,2)-  iProm  C{AE)  =  0{£^)  it  is  obvious  that 
all  terms  of  order  e  vanish.  Comparing  the  terms  corresponding  to  e:^E^  yields 
{2ipk  -  Xo{k))A2,k  =  which  results  in 

(Oi  —  (— 1))A2,o  =  2/2!^^  +  5^1  •  0, 

{2ip  -  2i/?)^2,i  =  /2  •  0  +  5^1  •  0,  (2.5) 

(4z/3  -  (-9  -  2i/3))A2,2  =  (/2  +  i9i)A‘^. 

{An^i  with  /  <  0  is  always  given  by  the  complex  conjugate  of  An-h) 

We  are  now  ready  to  derive  the  associated  Ginzburg-Landau  equation  by  com¬ 
paring  the  coefficients  of  £:^E: 

At  +  Ax  X  +  ^Ao(1)^2,i  +  (2^/?  —  Ao(l))^3,i 

=  A-\-  (2/2  +  igi){AA2^o  +  ^^2,2)  +  (3/3  +  i92)\A\^A. 

Using  Ao(l)  =  2il3,  Cg  =  X'q{1)  ^  0,  and  the  previously  obtained  expressions  for 
A2,i  the  result  reads: 


At  =  (4  -h  Sil3)Axx  A  AA  cjApA, 

where  c  =  (2/2  +  i^i)(2/2  A  )  +  3/3  + 192- 


(2.6) 


Of  course,  we  can  write  out  all  the  equations  corresponding  to  e^E^  For  I  ^  ±1 
we  obtain  an  algebraic  equation  for  A^^i  in  terms  of  A  and  all  the  lower  order 
functions  with  n  K  ti.  For  /  =  il  and  ?2  ^  4  we  obtain  a  linear  partial 

n,l 

differential  equation  of  the  form 


1  =  (4  -h  3iP)dx An-2,1  +  (1  +  2c\A\'^)An-2,i  +  cA^^n-2,-1  +  Pa{T,X), 
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Here  pA  depends  only  on  A  and  A~j  (and  their  derivatives)  where  either  /  ^  ±1 

and  n<n  —  lorZ  —  ±1  and  n  <  n  —  2. 

Therefore,  we  are  able  to  calculate  arbitrarily  high  correction  terms  An^k,  as 
long  as  A  is  sufficiently  smooth.  Studying  the  exact  interactions  of  derivatives  and 
powers  in  e  we  arrive  at  the  following  result. 

Corollary  2.2 

Assume  that  A  :  [0,To]  x  JR  — »  €  has  bounded  continuous  partial  derivatives 
d^d^A  for  all  i,j  >0  with  2i  +j  <  m  >  2.  Then,  the  above  construction  yields 
an  approximate  solution  u  —  such  that 

Res(u)  —  Cu  —  Af{e,  u)  = 

uniformly  for  (t,x)  €  [0,To/£^]  x  JR. 

Thus,  we  have  found  approximate  solutions  in  the  sense  that  the  residual  for 
the  original  problem  is  as  small  as  we  like.  It  remains  to  show  that  small  residuals 
also  imply  small  errors,  which  is  done  using  the  boundedness  of  the  associated 
linearized  semigroup.  Using  the  ansatz  u  —  we  can  derive  a  differential 

equation  for  the  scaled  error  R.  As  has  size  of  order  e  the  error  scaling 
will  satisfy  1  <  m  <  m  +  1.  If  we  are  able  to  show  that  R  remains  of  order  1  for 
t  C  [0,  To/e^],  the  desired  result  is  established.  We  ask  for  a  time  scale  of  order 
l/e^,  which  is  absolutely  necessary  to  make  any  sense  out  of  the  result.  Recall  that 
the  modulations  of  A  appear  on  an  order  1  time  scale  in  the  slow  time  T.  To  see 
this  eflPect  in  the  approximate  solution  U'^  it  is  necessary  to  follow  the  solution  in 
the  unsealed  time  t  over  an  interval  of  length  Toje'^. 

3  Error  estimates 

As  mentioned  above  we  can  find  approximate  solutions  Ua  such  that  the  residual 
is  arbitrary  small.  The  error  r  between  this  approximation  and  a  suitable  true 
solution  u  should  be  small.  Letting  r  =  u  —  Ua  =  e’^R  we  obtain  for  the  error  the 
equation 

CR  =  (V(e,  Ua  +  e^R)  -  M{e,  Ua)  -  Res(t/^)) 

=  DuM{e,  UaH,  x))R  +  M{e,  UA{t,  x),R)-  £-"Res(t/^), 

where  M{e,u,R)  =  e~'^{N{e,u e^R)  —  M{e,u)  ~  DuM{£,u)e^R)  —  0{e^\R\^) 
uniformly  for  e  €  (0,1].  The  main  difficulty  lies  in  the  size  of  the  linear  part 
DuM{£,  Ua)^  since  it  allows  for  an  exponential  growth  rate  of  the  error.  As  Ua  = 
0(£)  we  have  DuAf{e,UA)  =  0{£^),  where  k  =  1  in  the  general  case  and  k  =  2 
whenever  A/2  vanishes.  The  latter  case  is  also  called  the  cubic  case,  as  the  nonlin¬ 
earity  starts  with  cubic  terms.  We  first  treat  this  cubic  case,  which  is  much  easier 
than  the  general  quadratic  case. 

In  order  to  build  a  mathematically  basis  for  this  theory  we  introduce  a  function 
space  Z  in  which  the  solutions  u  of  (2.1),  the  approximate  solutions  U4,  and  the 
error  in  (3.1)  will  be  considered.  Associated  to  this  is  a  function  space  Y  for  the 
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solutions  A  of  the  Ginzburg-Landau  equation  (2.6).  We  consider  the  mapping 
A  i->  C/4  as  a  mapping  from  the  Ginzburg-Landau  space  Y  into  the  phase  space  Z. 

We  choose  the  function  spaces  Z  =  H^{M)  and  Y  =  H^{}R)  0  where 

we  continue  to  distinguish  Z  and  Y,  sls  Z  contains  functions  depending  on  the 
unsealed  spatial  variable  x  ^  R  whereas  Y  contains  functions  on  the  slow  spatial 
variable  X  £  ]R.  For  convenience  we  also  use  the  abbreviations  Z^  ~  (M)  and 

=  H^{1R)  0  We  use  the  scaling  mapping  :  Y"^  Z^\A{-)  i-> 

A{s  ^ )  and  the  lift  Y  3  A^U\=^  e((W£^)E  +  (H^)E“^)  G  2',  which  satisfies  the 

||[/A||co<2£P||co,  ||C/i|U  <  2eV2||^||y. 

(We  continue  to  use  for  the  space  of  functions  on  JR  which  have  j  bounded 
continuous  derivatives,  where  ||  •  \\cj  is  the  sum  of  the  supremum  norms  of  these 
derivatives.)  The  F^-estimate  does  not  contain  the  coefficient  e  due  to  the  fact 
that  the  scaling  of  the  spatial  variable  X  =  ex  diminishes  the  decay  rate  in 

An  important  feature  of  the  theory  is  to  control  the  dominating  linear  operator 
C  =  dt-  L{da:)  with  L{dx)  =  -(1  +  +  I3{3  A  dDda^.  In  the  chosen  spaces  the 

following  result  holds: 

Lemma  3.1 

The  densely  defined,  closed  linear  operator  L{dx)  :  Z^  C  Z^  Z^  generates  a 
holomorphic  semigroup  S(t)  =  t  >  0,  which  satisfies,  for  all  m  >  n  >  0, 

the  estimates 


||5(t)||z’^^^n  <  1,  for  t  >0; 

WSmz^^zr-^  <  for  i  >  0. 

Proof:  The  result  follows  easily  by  Fourier  transform  T.  Letting  u  =  fFu  the  trans¬ 
formed  equation  reads  ut  =  Xo{k)u  which  has  the  solution  u{t,k)  =  fc). 

Using  the  fact  that  the  norm  in  Z^  is  equivalent  to  \\{l  +  k^)'^^^u{k)\\zo,  the  result 
is  easily  established  from  (1  +  |eAo(fc)t|  <  (1  +  ^ 

for  m  =  n  and  <  G^-n(l  +  for  m  >  n.  □ 


The  boundedness  of  the  linearized  flow  at  criticality  {e  =  0)  is  one  of  the  essential 
features  in  showing  that  approximate  solutions  17^  are  close  to  true  solutions;  it 
allows  us  to  turn  small  residuals  onto  small  errors.  We  can  now  rewrite  the  nonlinear 
evolution  problem  (2.1)  as  integral  equation 


u{t)  =  S{t)u{0)  A  [  S{t-  r)Af{s,  u{r))  dr. 
Jo 


Here  J\f{e,  •)  is  a  smooth  mapping  from  Z  into  Z^  =  L^{]R).  By  standard  semilinear 
parabolic  theory  (cf.  [HenSl])  we  obtain  a  local,  continuous  semigroup  SI  on  Z,  that 
is,  u{t)  =  ('u(O))  solves  (2.1).  Note  that  the  longtime  existence  of  the  solutions  is 

not  clear,  since  standard  theories  involving  Gronwall  estimates  guarantee  existence 
of  <?t^(uo)  only  for  t  £  [0, cmin{l/e^,  1/p}]  for  all  uq  with  ||t^olU  —  P’ 

Similarly,  the  Ginzburg-Landau  equation  can  be  treated  in  order  to  obtain  the 
local  semigroup  Qt  on  Y ,  such  that  A{T)  —  Qt{A(I)  solves  (2.6).  The  question 
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of  justification  of  the  Ginzburg-Landau  formalism  is  now  concerned  with  estimat¬ 
ing  the  difference  between  the  true  solution  u{t)  —  ^ii^Ao)  approximate 

solution  u{t)  =  U\{t)  ~  e{{W£Ge^t{^o))^  +  c.c.). 

3.1  The  case  of  a  cubic  nonlinearity 

Since  the  case  with  the  cubic  nonlinearity  serves  as  an  introductory  part  we  make 
the  further  assumption  that  the  nonlinearity  in  (2.1)  is  given  solely  by  /(w),  that 
is  g  =  0.  This  leads  to  the  simplification  that  Af  is  a,  smooth  mapping  from  Z  into 
itself  rather  than  the  larger  space  Z^. 

Theorem  3.2 
(Cubic  nonlinearity) 

Let  /2  =  /"(0)/2  =  0  and  =  0.  Let  A{0)  e  H^{M)  =Y^  cY  be  such  that  A  e 
C^([0,  To],  y^)  for  some  Tq  >  0.  Then,  for  every  d  >  0  there  exist  €o  >  0  and  D  >  0 
such  that  for  all  e  C  (0,£o)  ^he  following  statement  holds:  Let  u(t)  =  5j^(u(0))  be 
a  solution  of  (2.1)  such  that  ||u(0)  —  U\{0)\\z  <  then  u  exists  on  the  whole 

interval  [0,To/e^]  and  the  estimate 

||w(t)  -  U\{t)\\z  <  for  all  t  e  [0, To/e\  (3.2) 


is  satisfied. 

Remark:  Note  that  u{t)  =  U\{t),  and  hence  u,  are  of  order  e  in  L°°(iR)  and  of 
order  in  Z.  Thus,  the  error  of  order  is  relatively  small. 

Here  we  have  assumed  more  smoothness  on  the  initial  condition  A(0)  than  is 
actually  needed.  We  will  improve  on  this  when  we  consider  the  case  of  general 
nonlinearities. 

Proof:  We  want  to  show  that  the  error  u{t)  ~  U\{t)  remains  of  order  over 

the  time  interval  [0,To/e^].  However,  substituting  U\  into  (2.1)  leaves  the  residual 
terms  of  order  e.g.  e^f^A^E^.  Integrating  such  a  term  over  [0,To/e^],  leads  to 
an  error  0{e).  To  avoid  this  difficulty  we  follow  the  ideas  of  [KSM92]  and  use  the 
improved  approximation  u{t)  —  U%  which  is  given  by 


U^{t,  x)  —  eAE  +  E^(yl3^3E^  +  A3^iE)  +  C.C., 

where  ^43^3  =  /3/(64+24z/?)A^  and  A34  =  0.  As  the  coefficient  A^^x  will  not  appear 
in  the  residual  up  to  order  we  may  set  it  equal  to  zero  rather  than  taking  the 
correct  value  ^3,1. 

From  the  construction  and  the  considerations  in  Section  2  we  find  that  all 
residual  terms  of  order  s-f  with  =  1, 2, 3  cancel.  The  remaining  terms  are  at  least 
of  order  and  are  functions  of  dl^d^A  with  U  k  <  4.  Prom  the  assumption 
A(0)  G  y^  we  know  that  A  e  CO([0,To],y^)  fl  Cn[0,To],y2).  Thus,  the  residual 
can  be  estimated  by 

||Res(£r3(t))||2  < 

where  again  a  factor  is  lost  by  scaling. 
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The  true  solution  will  be  written  as  u{t)  =  u(t)  +  such  that  R  satisfies 

||/?(0)||z  <d  +  -  U^miz  =  d  +  0{s)  <  2d 

for  €  G  (0,  ei]  for  some  £i  >  0.  Moreover,  R  satisfies  the  differential  equation 

CR  =  DuAf{s,  Ul{t))R  +  M{s,  R(t))  +  p^it),  (3.3) 

where  pe{t)  =  -£-^/^Res(U^(t))  with  \\p,{t)\\z  <  for  t  e  [O.To/e^]  and  e<ei. 
The  linear  operator  DuA/"  ■.  Z  ^  Z  is  given  by  i?  i— »  (e^  +  f'{U\))R  and  can  be 
estimated  by 

\\DJ/ie,Ul)\\z-.z  <  2|k"  +  f'{Ul)\y  <  for  e  <  £i. 

The  factor  in  this  estimate  is  essential  for  the  following  theory;  it  is  due  to 
the  fact  that  the  nonlinearity  is  cubic,  that  is  f'{u)  —  0{u^).  In  addition,  the 
nonlinearity  M  allows  for  a  good  estimate,  since  the  second  derivative  is  still 
of  order  0{\\U%\\co)  =  0(e).  In  particular,  for  each  D  >  0  there  is  a  constant 
€2(0)  such  that 

\\M(£,U\(t),R)\\z  <  s^^‘^C2{D)  for  all  R  with  \\R\\z  <  D  and  all  e  <  Si. 


The  error  satisfies  the  integral  equation 


R(t)  -  S(t)R(iS)  + 


f  S{t-T)  fz)„V{e,  Ul{r)R{T)  +M{6,  f7l(r),i?(r))  +Pe(r)] 
Jo 


dr. 


Using  Lemma  3.1  and  the  estimates  derived  above  and  assuming  that  the  solution 
R  does  not  leave  the  ball  of  radius  D  in  Z,  we  obtain 


P(t)||z<2d  +  /o^||^(t- 


'^JWz^z 


\\BuJJ'\\z^z\\R('^)\\z  +  ||AH-Pe||z 
1 

dr 


dr 


<2d  +  /o‘  c[c^e^RiT)\\z  +  s^/^C^iD)  + 

<2d  +  CTo  [e^/^C2{D)  +  Cp]  +  C£^Ci\\R{t) 


Izdr. 


Gronwall’s  inequality  yields 

\\R(t)\\z  <  (2d  +  <  a{e,D)e^^^'^^ . 

We  now  choose  the  radius  D  of  the  ball  in  Z  to  be  equal  to  2q;(0,  and 

then,  £0  is  chosen  such  that  a(e,D)  <  2q:(0,0).  Now,  R{t)  cannot  leave  the  ball 
of  radius  D  for  t  e  [0,ro/£^]  and  estimate  (3.2)  is  established.  Obviously,  the 
solution  R  exists  and  is  unique  by  standard  local  existence  results.  This  proves 
that  w(t)  =  U\(t)  +  £^^^R(t)  exists  on  the  whole  interval  [0,To/£^].  □ 


3.2  The  case  of  quadratic  nonlinearities 

We  now  want  to  study  the  general  case  where  Af  is  not  restricted  to  start  with  cubic 
terms.  The  problem  with  the  previous  approach  is  that  the  derivative  DuAJ'ie^  ^7a(0) 


THE  COMPLEX  GINZBURG-LANDAU  EQUATION  AS  A  MODULATION  EQN.  201 

is  only  of  order  e;  this  would  only  allow  for  estimates  on  a  time  interval  [0,To/e] 
which  is  too  small  to  see  interesting  modulations. 

The  new  idea  for  handling  quadratic  nonlinearities  is  to  use  the  mode  structures 
more  explicitly.  By  quadratic  interaction  of  the  modes  and  we  generate  the 
modes  E^  with  I  £  {n  m,n  --  m,—n  m,~n  —  m}.  In  particular,  the  critical 
modes  with  n  =  ±1  generate,  through  quadratic  interactions,  only  modes  E^  with 
/  G  {—2,0,2},  but  all  these  modes  are  uncritical,  that  is  they  are  exponentially 
damped.  Thus,  we  split  the  solutions  and  the  error  into  two  parts,  one  consisting 
of  modes  close  to  k  =  ±1  and  the  other  contains  all  the  remaining  modes.  This 
splitting  is  affected  by  mode  projections  defined  in  Fourier  space.  We  let  x(^)  = 
1  for  G  /c  =  [—5/4, —3/4]  U  [3/4, 5/4]  and  zero  elsewhere,  and  we  define  the 
projections 

Pc  :  Z  ^  Z;u  Pg  :  Z  ^  Z]u  u  —  PcU. 

We  shortly  write  u  =  Uc-\-  Ug  with  Uc  =  PcU  the  critical  part  and  Ug  —  PgU  the 
stable  part.  Obviously,  these  projections  commute  with  the  differential  operator 
L{dx)  and  the  following  results  hold. 

Lemma  3.3 

a}  For  aUm>n>0  and  allt>0  we  have 

\\PsS{t)\\z^^Z^  <  Crr,-„{1  +  (-(— »)/4)e-‘/6^  ||Pc|U-^Z™  <  Cm-„. 

b)  For  all  u,v  £  Z  we  have  Pc[{Pcu)  •  (Pc^)]  =  0,  where  U'V  £  Z  means  ordinary 
pointwise  multiplication. 

Remarks: 

1.  The  first  estimate  shows  that  the  stable  modes  are  strictly  exponentially 
damped. 

2.  The  second  estimate  in  a)  means  that  the  critical  mode  projection  is  a 
smoothing  operator:  the  functions  in  PcZ  are  in  fact  holomorphic  functions. 

3.  The  last  assertion  exactly  is  the  manifestation  of  the  fact  that  quadratic 
interactions  of  critical  modes  are  not  critical  any  more. 

Proof:  For  part  (a)  we  proceed  as  in  Lemma  3.1  but  now  the  supremum  of  the 
multiplier  (l+A:2^(m-n)/2g-(i-A:  )  t  jg  taken  over  the  relevant  intervals,  namely 
Ic  —  [—5/4,  —3/4]  U  [3/4, 5/5]  and  Ig  =  ]R\  Ic,  respectively. 

Part  (b)  follows  immediately  from  the  fact  that  the  product  in  a:-space  is  trans¬ 
formed  to  a  convolution  in  Fourier  space.  Hence  T{PcU  •  Pcv)  has  its  support  in 
Ic  +  Ic  —  [—5/2,  —3/2]  U  [—1/2, 1/2]  U  [3/2, 5/2],  and  Ic  H  [Ic  +  /c)  =  0  implies  the 
result.  □ 


Since  we  are  now  able  to  study  the  critical  components  separately  from  the 
stable  ones  we  can  use  the  damping  in  the  stable  components  to  compensate  for 
lower  powers  in  e  in  the  error  equation.  In  order  to  estimate  u{t)  —  U\{t)  we 
introduce  an  intermediate  approximation 

U\{t)  =  e[PoW,A)^  +  s\PoW,A2,2)E^  +  ^I{PoW,A2,o)  +  c.c. 
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where  is  the  scaling  operator  Z^]A  i-^  A{e-)^  and  Pq  : 

Z‘^\u  1-^  is  ^  mode  projection  corresponding  to  the  zero  mode,  namely 

=  1  for  |/c|  <  1/12  and  0  elsewhere.  The  amplitudes  ^2,2  and  ^2,0  are  given 
in  (2.5);  note  that  ^2,1  is  not  included  as  it  will  not  contribute  to  the  relevant 
terms. 

The  estimation  of  the  residual  consists  in  giving  rigorous  bounds  on  the  terms 
which  where  left  over  after  equating  the  lowest  order  terms  to  zero.  These  remaining 
terms  possess  only  formally  the  magnitude  e'^  and  have  to  be  controlled  in  our 
function  space  To  this  end  it  is  necessary  to  use  the  smoothing  properties  of 
the  solutions  A  of  the  Ginzburg-Landau  equation:  starting  with  ^(0)  G  T  =  = 

H^{M)  implies 

P(T)|lyn+i  +  l|arA(T)||yn-i  <  Gn(l  +  T--/"),  for  T  G  (0,ro].  (3.4) 

Moreover,  we  have  to  control  the  scaling  properties  of  IV^A,  which  can  only  be 
measured  by  taking  derivatives:  dxiWsA)  =  sWs{dxA^  is  one  order  smaller  than 
WeA  itself.  Additionally  it  is  important  that  the  scaling  concentrates  the  modes  in 
Fourier  space  around  the  wave  number  zero,  such  that  application  of  the  projection 
Po  does  not  change  a  scaled  function  too  much. 

Lemma  3.4 

For  each  n  ^  Nq  there  exist  constants  Cn  such  that 

n 

\\W,A-PoWeA\\zn  \\W,A\\zr^ 


for  all  A  e 

Proof:  The  first  result  is  obtained  by  Fourier  transform: 

\\W,A  -  PoWeA\\%r.  <  Cjj,  1(1  +  |fcr)(l  -  Xoik))^,{PA){!l)\'  dk 
<Cfj^  \2il2re”{l  +  \k\”){TA)il)\' dk 

=  C(12)2-e2n-i  1(1  +  \K\^){PA){Kf  dK  <  C{\2f^e'^^-^\\A\\l^, 

where  l-Xo(^)  =  0  ^r  \k\  <  1/12  was  used  essentially.  The  second  estimate  follows 
easily  as  each  spatial  derivative  of  the  scaled  function  produces  a  factor  e.  □ 


The  treatment  of  the  quadratic  nonlinearity  follows  the  analysis  in  [Sch94a]; 
however,  we  use  different  function  spaces,  and  we  relax  the  smoothness  assumption 
on  A(0).  As  seen  above,  it  is  necessary  to  use  spatial  derivatives  in  order  to  gain 
smallness  which  is  expressed  in  powers  of  £.  But  at  the  same  time  the  finite  smooth¬ 
ness  of  A{T)  for  T  =  0  will  generate  temporal  singularities  in  the  residual  through 
the  terms  ^JA(T).  The  residual  estimate  given  below  provides  exact  control  on 
the  temporal  singularities  and  the  associated  powers  of  e. 

Theorem  3.5  _ 

Let  A  =  A{T)  e  C®([0,  To],  F),  U\,  and  U\  as  above.  Then,  the  following  estimates 
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hold:  _ 

Wlit)  -  U\{t)\\zn  <  +  t(i-)/2), 

4PcUj{t)\\zn  +  \\P,lP^{t)\\zn  <  CnS^/\ 

4PcUlmcn  +  \\PsUl{t)\\c^  <  Cne\ 

\\PcResiU\m\z’^  <  C7„£3/2(i  +j)-i, 

\\P,Res{Ul{t))\\zn  <  (7„eV2(i  +  f-i/2). 

Proof:  For  the  first  estimate  we  consider  which  is  defined  as  C/|  but  without 
the  projections  Pq.  Then, 

\\Ul  -  U\\\zr^  <  Cs2(||Weyl2,2||z>^  +  \mA2,o\M 

<C'e3/2(i_,_^n^^gn[i  +  (£2^)-(/-l)/2]^  <  C'e3/2 (1  +  t(l-n)/2), 

where  (3.4)  and  the  second  estimate  in  (3.5)  was  used.  The  difference  U\  —  U\ 
can  now  be  controlled  by  the  first  estimate  in  (3.5)  and  (3.4),  and  then  our  first 
estimate  is  established.  The  bounds  on  U\  follow  immediately  from 

WP^W.AWzr,  <  ||Poilzo^Z"l|t^e^lUo  <  Cne-^/^A\\zo. 

and  ||lTe74||co  =  The  first  estimate  in  (3.6)  is  a  consequence  of  (3.5),  (3.4), 

and  the  explicit  representations  of  U\  and  U\.  This  proves  (3.6). 

For  the  estimate  of  the  residual  we  write  U\  =  eui  +5^1x2,  where  ||ttj||ci  ^ 

^  and  Pui  has  support  in  [-13/12,-11/12]  U  [11/12,13/12]  and 

similarly  Pu2  has  support  in  the  three  intervals  of  radius  1/12  centered  around 
—2, 0,  and  2.  The  residual  has  the  expansion 

Res(t7l)  =  dtU\  -  L{d^)Ul  -  V(e,  Ul)  =  Resi  + 

with  Resi  =  edtUi  +  s^dtU2  —  eLui  —  e^Lu^  —  e^A/2(Mi,  wj) 

+A/2(wi,U2)  +A/3(ui,Ui,Ui))  = 


(3.6) 

(3.7) 


where  6-i  =  6i  and  supp.7^5;  C  [/  — 1/4,  ?  + 1/4] .  The  special  splitting  of  the  residual 
according  to  the  modes  in  Fourier  space  allows  for  an  easy  distinction  between  the 
stable  and  the  critical  part,  namely  PcResi  =  6i.  (This  was  the  reason  why 

the  support  of  Xo  was  chosen  as  [—1/12, 1/12].) 

We  immediately  see  To  estimate  62  we  proceed  as  follows: 


62  =  e^dt{PoWeA2,2)^^  -  L(a^)[(PoW,A2,2)E2]  -  e‘^Af2{PoWeAE,PoWeAE) 


=  [s^PoW,dTA2,2  +  £^((4f/3  -  Ao(2))PoH^e^2,2  -  M2is)A2,2) 
-  [f2{PoWeA)^  +  giPoW,A{iPoWeA  +  ePo{W,dxA))]E^, 


where  M2{e)  :Y^  E-2(Ao(2)  -  L{da:))  [{PoWsB)E^]  .  Using  the  defini¬ 

tion  of  i42,2  from  (2.5)  and  (3.5)  we  find 


||<^2||zo  <  /VsH  e‘^\\M2{s)\\Yi^z^^ 

-^s^\f2Pi9i\\\{PoW,A)^-Po{WeAnzoPe^\gi\\\{PoWs^^^ 
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The  fourth  term  is  of  order  uniformly  for  t  6  [0,  Tq/s'^].  The  third  term  can  be 
estimated  with  (3.5)  and 

WiPoB)'^  -  Po{B'^)\\zo  <  \\{PoBf  -  B^zo  +  \\B^  -  Po{B^)\\zo 

<  \\PoB  +  Bl|co||PoB  -  B\\zo  +  ||Fo(52)  -  B^zo', 

and  we  again  obtain  The  linear  operator  M2(£)  is  controlled  via  Fourier 

transform: 

||M2(£)5|||„  =  4  (Ao(2)-Ao(2  +  fc))^(^5)(fc/£)  \k 

<  C\k\/£{PB){k/e)f  dk  =  C^£\\dxB\  <  C^e\\B\\l,. 

Exactly  the  same  procedure  can  be  used  to  estimate  6o  and  the  desired  first  estimate 
in  (3.7)  is  shown. 

The  estimation  of  the  critical  part  is  a  little  more  tiresome  but  follows  the  same 
scheme: 

6i  =  £^{PoWedTA)B  -  e^{-\'i{i)PoWedlAl2  +  PoW^A)^  +  e[Mi(e).4]E 

-£3  {2M2{PoWeAE,  PoWe^.oE")  +  JV2(PoM4^2.2E3,  PoWeAE-'^) 

+  ^MsiPoW^AE,  PqW^AE,  PoWeAE-^^ , 


where  Mi  (£)P=E-'  (Ao(1)+|A[;(1)52-L(5^))[(Po1F^P)E].  Inserting  the  Ginzburg- 
Landau  equation  to  eliminate  the  leading  linear  part  we  obtain  a  nonlinear  term 
which  is  close  to  the  other  nonlinear  terms,  such  that  the  difference  can  be  esti¬ 
mated  as  above,  but  now  with  the  bound  For  the  operator  M\{e)  : 

we  proceed  as  follows 

\\M\(e)B\WQ  —  (Ao(l  + /c)  —  Ao(l)  -  Ao(l)/c  -  Ao(l)^)^^^(-^^)(^/^) 

<  C\k\P/£{J^B){k/£)fdk  =  ^llyp, 

where  p  =  1, 2,  or  3.  Thus,  we  find 


^3/2  ^5/2  ^3/2 

£||[Mi(£)yl{T)]E||zo  <  C7£min{£'/3, 


and  the  second  estimate  in  (3.7)  is  established  by  noting  e’!'^  <  Tq£'^/‘^I{\  +  t)  on 
the  relevant  time  interval.  ^ 


Theorem  3.6 
(General  nonlinearity) 

Let  Aq  G  H^{]R)  —Y  be  such  that  A{T)  —  Gt{A(})  satisfies  A^C  ([0,To],T)  for 
some  To  >  0.  Then,  for  every  d  >  0  there  exist  £o  >  0  and  C  >  0  such  that  for  all 
£  G  (0,  £q)  the  following  statement  holds: 

Let  u{t)  =  Sf{u{0))  be  a  solution  of  (2.1)  such  that  ||u(0)  -  U\{t)\\z  <  dz  >  , 
then  u(t)  =  S^(u(0))  exists  on  the  whole  interval  [0,To/e:^]  and  the  estimate 

||Pe[u(f)  -  Uimiz  <  Ce^'^  -  PlitMz  <  G(£3/3  +£5/4g-t/6)^  (3.8) 

for  all  t  G  [0,  To/e^],  is  satisfied. 
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Remarks:  Note  that  the  initial  condition  w(0)  is  only  close  to  the  first  order 
approximation  U\{0),  but  the  result  states  closeness  of  the  solution  to  the  second 
order  approximation  such  that  the  error  in  the  stable  part  is  even  one  order  smaller. 
This  is  a  first  result  in  the  direction  of  attractivity,  which  means  errors  are  damped 
out. 

It  is  clear  from  the  proof  below  that  the  result  of  the  theorem  can  be  changed 
in  such  a  way  that  can  be  replaced  by  with  f3  €  (1,3/2),  or 

Proof;  We  write  the  solution  u~u{t)  of  the  full  system  in  the  form 

u  =  euc  +  e^Us  +  (3.9) 

where  Uc  =  e~^PcU%  Ug  =  e~‘^PsU%  Rc  =  PcRc,  and  Rg  —  PsRs^  By  construction 
we  know  that  Uc  and  Ug  are  uniformly  bounded  in  C^{IR)  for  t  G  [0,  To/e^],  thus  we 
will  estimate  them  henceforth  by  C. 

Using  Theorem  3.5  it  is  sufficient  to  show  that  Rg  and  Rc  are  bounded  uniformly 
in  e:  G  (0,e:o]  and  t  <  To/e^  in  the  appropriate  norms.  We  define 

K:{£,  Uc,  Ug,  Rc,  Rg)  =  M{£,  Ul  +  e^^^Rc  +  e^^‘^Rg)  -  M{e,  Ul)  -  Res(Ul), 

and  note  the  expansion 

Uc,  Ug,  Rc,  Rs)  -  e^^^'^iRc  +  eRg)  -  +  eug,  Rc  +  eRs) 

~e^f‘^N2{Rc  +  eRg,Rc-¥eRg)  +  Res{Ul)\\%  (3-10) 
<  Cos'"/"  [||uc  +  eug\\l,  +  s'/"Caa(||Rc  +  eRg\\z)]  \\Rc  +  eRg\\z, 

where  CV  is  some  continuous  functions  from  [0,  oo)  into  itself.  Inserting  (3.9)  into 
(2.1)  and  projecting  onto  the  critical  and  stable  part  we  obtain  the  coupled  system 

df,Rc  —  P{dx)Rc  "I"  ^c(^J  Rci  Rs)i  2 JI^S 

dtRs  =  P{^x)Rs  -\-  ICs{£,  Uc,  Ug,  Rc->Rs)i 

where  Kc  =  e-^I^PcK  and  Kg  =  e’^^I^PgK. 

Note  that  A/2(w, c)  =  /2Wi?  +  gi{uxV  +  uVx)/2  can  be  estimated  in  two  ways, 
namely 

\\cN2{u,v)\\zo  <  Cmin{||ti||ci||v||z,lkl|z||w|U}- 

We  will  always  use  the  C'-norm  for  Wc,s  but  the  Z-norm  for  the  errors  i?c,5. 

Using  (3.10),  Lemma  3.3(b)  and  Theorem  3.5  we  obtain  the  following  estimates 

||^c(*  •  ‘)\\z^  ^  ^||Rc||z  +  ll-Rsllz  +  ^'/"C/c(|iRc+^Rs||z)^  +  s'/"/(l  +  t), 

||/c,(. .  Olizo  <  c(||iic  +  eRsWz  +  £^^*Ck{\\Rc  +  eRsWz)  +  ^ . 

The  first  estimate  relies  heavily  on  the  cancellations  PcAf2  {uc,  Rc)  =  PcMi{Rcj  Rc)  = 

0. 

We  now  rewrite  (3.11)  with  the  variations  of  constants  formula  using  the  semi¬ 
group  S{t)  which  satisfies  the  estimates  in  Lemma  3.3(a).  By  the  assumption 
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on  the  initial  condition  and  the  first  estimate  in  (3.6)  (with  n  =  1)  we  know 
||i?c(0)  -\-eRs{0)\\z  <  C,  Assuming  \\Rc(t)  +  ei?s(i)ll  <  D  for  we  obtain 

WRcimz  <  C  +  (7^2  [||i?e(r)|U  +  \\Rs{r)\\z  +  e^/^C>c{D)  +  f^]  dr, 

PsWIU  <  Ce-‘/Ve  +  +  (t-r)-i/4)e-(‘-")/6 [||i?e(r)  +  eRs(T)\\z 

+  s^f^CK^iD)  +  £1/4(1  +  r-i/2)]  dT. 

The  stable  part  Rs  may  have  an  initial  condition  of  order  1/e. 

We  introduce  the  functions  rc,s{t)  =  max{  i|i?c,s(^)liz  :  r  G  [0,t]  }  and  obtain 
from  the  second  inequality  ||i^s(t)||  < 

0 

Thus,  we  obtain  rs{t)  <  je  +  %C[rc{t)  +  ers{t)  +  e^^^C]c{D)]  +  Ce^^^  which, 

for  e  <  17(12(7)  leads  to 

WRsimz  <  rs{t)  <  2Ce-‘/V£  +  12Cr,{t)  +  \lCe'^l\CK.{D)  +  1). 

This  result  can  be  inserted  into  the  first  estimate  to  give 

r-cW  <  C  +  C7£2  J*  [re(r)  +  e-Z^/e  +  e^/‘^CK(D)]dT  +  Ce^'^  1/(1  +  r)  dr 

<  Ce^  f*  rdr)  dr  +  C*  [e  +  e^/^  (ToC>c(D)  +  1  +  Tq  +  log(ro/£2))] , 

for  all  t  e  [0,ro/£^].  The  classical  Gronwall  estimate  yields  on  the  same  time 
interval 

rdt)  <  ac{s,  D)  and  rdt)  <  Cse~*Z^f£  +  Usie,  D), 

where  the  continuous  functions  ac,s  do  not  depend  on  D  when  e  is  set  equal  to  0. 

Now,  we  choose  D  =  2q;c(05  0)  +  2q;s(0,  0)  +  2(7s,  then,  there  is  a  Sq  such  that 
for  all  e  6  (0,60]  we  have  ade^D)  +  (7s  +  eas{e^D)  <  D,  Hence,  the  solutions 
Rc{t)+eRs{t),  t  G  [0,To/e:^]  cannot  leave  the  ball  of  radius  D  in  Z,  and  the  desired 
estimates  follow. 

We  may  refine  the  above  by  giving  estimates  in  smoother  spaces. 

Corollary  3.7 

The  estimate  (3.8)  can  be  generalized  to 

ll«W  -  Ul{t)\\zn^^  <  C„(l  +  i-”/4)£5/4, 

Wlit)  -  t/A(t)llz"-H  <  (7„(1  +i-"/")£®/2 

where  n  E:  JN  is  arbitrary. 

Proof:  This  result  is  obtained  the  same  way  as  in  the  proof  of  Theorem  3.6,  we 
only  have  to  estimate  Rc  and  Rg  in  .  Having  the  boundedness  of  Rc^s  in  if 
is  classical  linear  regularity  theory  for  parabolic  PDEs  (cf.  [HenSl])  to  derive  the 
estimates 

+  P.(l)|Un+.  <  C  (1  +  t-”Z*[\\RMh  +  ||i?.(0)||z])  . 


□ 


(3.12) 


THE  COMPLEX  GINZBURG-LANDAU  EQUATION  AS  A  MODULATION  EQN.  207 

Recall  that  the  nonlinearity  N  is  arbitrarily  sraooth  and  Res{U\{t))  is  bounded  in 
each  Z^.  Using  the  first  estimate  in  (3.6)  the  assertion  is  established.  □ 


In  general,  it  is  possible  to  use  U2{t),  n>  2  as  an  approximate  solution;  then, 
the  corresponding  residual  is  smaller:  ||Res(C/2(^))||^o  =  0{6'^).  In  such  situations 
it  is  possible  to  use  the  ansatz 

u  =  €Uc^  e^Us  + 

with  smaller  coefficients  in  front  of  the  errors  Rc,s  •  Employing  the  above  theory  it 
is  possible  to  derive  rigorous  error  estimates  of  the  form 

ll«(i) -UmWz  <  for  t  €  [0,To/e% 

where  ||u(0)  -  U^{Qi))\\z  <  is  assumed. 


4  Further  results 

4.1  Absorbing  balls  in  uniformly  local  function  spaces 

To  develop  the  theory  further  it  is  necessary  to  switch  to  more  appropriate  func¬ 
tion  spaces  which  have  nicer  properties.  First  of  all,  we  should  mention  that  it 
is  of  practical  interest  to  include  bounded  functions  into  the  analysis  rather  than 
functions  in  which  have  to  decay  at  infinity.  In  particular,  periodic  (or 

quasiperiodic)  functions  and  travelling  fronts  are  the  center  of  many  research  stud¬ 
ies  in  the  theory  of  pattern  formation  on  unbounded  domains.  But  there  is  also  a 
major  mathematical  reason  to  use  different  function  spaces,  namely  the  fact  that 
the  Ginzburg-Landau  equation 

drj^A  =  ad%A-\- bA  —  c\A\‘^A,  a^b,c€(0^  Rea>0,  (4.1) 

does  not  have  an  absorbing  ball  in  H^{]R).  This  can  be  seen  by  considering  the 
real  case  with  a,  6,  c  >  0  and  starting  with  any  real  initial  condition  Aq  G  {JR) 
and  Ao{X)  e  (0,  ^/b/c).  Then,  from  the  maximum  principle  we  conclude  0  < 
A{T,  X)  <  y/b/c,  and  moreover,  A{T,  X)  for  T  ^  oo  and  X  fixed.  It  can 

be  shown  that  ||A(T,  Ollyo  >  C\/T  for  large  T. 

Following  [MS94]  we  introduce  function  spaces  and  which  will  replace  the 
old  spaces  Z'^  and  The  new  spaces  contain  all  suitably  smooth  and  bounded 
functions  and  are  based  on  theory  such  that  Fourier  transform  methods  are  still 
available.  First  we  choose  a  positive  weight  function  p  G  C^(iR,  (0,  oo))  which  is 
bounded,  has  a  finite  integral  fj^p{x)dx,  and  satisfies  \p'{x)\  <  p{x)  for  all  x.  As 
a  consequence  we  obtain  p{x  Y  y)  <  e^^\p{x)  for  all  x,y.  (We  may  fix  p  once  and 
for  all  to  p{x)  =  l/cosh(a:)  or  p{x)  =  2/(2  -f-x^).)  Next  we  let 

w  ^  Aoc(^)  •  \W\\lJ  <oo})  with  ||w||^2  —sup  [  p{y  +  x)u{x)‘^dx, 

yeJRjjR 
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and  define  the  translation  operator  Ty  :  w  t— 'u(  •  +  ?/).  Our 

final  space  of  uniformly  local  functions  is  given  as 

Ll^{IR)  =  {ueLl,{lR)  :  \\TyU-u\\Li^-^0asy^0}, 

Note  that  different  weight  functions  lead  to  the  same  uniform  space  with  equivalent 
norms.  For  tl  G  we  define  the  associated  Sobolev  spaces  by  requiring 
that  the  first  n  distributional  derivatives  lie  in  The  condition  that  the 

translations  Ty  are  continuous  is  needed  to  guarantee  that  the  spaces 
neN,  are  dense  in  see  [MS94]  Lemma  3.1. 

We  now  define  the  corresponding  phase  spaces  Z'’^  =  ^r,u  functions  u 

and  the  phase  space  =  H^JM)  e  for  the  Ginzburg-Landau  equation. 

As  a  first  result  we  derive  an  upper  bound  for  the  radius  of  an  absorbing  set  for 
(4.1).  We  use  weighted  energy  norms  similar  to  [CoE90,  Co94],  however  we  provide 
explicit  bounds  in  terms  of  the  coefficients  a,  6,  c. 

Theorem  4.1 

Let  ar  =  Re  a,  W  =  Re  6,  and  =  Re  c  be  positive.  Then,  all  solutions  A  =  A(T,  •) 
of  (4.1)  with  A{0,  •)  e  exist  for  aJi  T  >  0  and  satisfy 

||^(r)ll|„  <  e-“^||>l(0)||I  +  (1  -  e-“^)Ao, 

limsup  ||axA(r)||i  <  (2(1  +  a  +  6Ao)  + 

T^oo  J  \ 

where  a  —  2br  +  |ap/ar,  Ao  =  Cpo/cr,  6  —  C'pmax{0,2|c|  —  4cr},  and  Cp  is  a 
constant  depending  only  on  the  weight  p. 

Proof:  We  proceed  as  for  standard  energy  estimates  but  use  weighted  norms. 
Partial  integration  then  involves  a  derivative  of  the  weight  function  p  which  can  be 
estimated  by  p  due  to  our  assumption. 

^  Jj^p\A\^dX  =  2-Rb  =  2Re  Jj^pA{aAxx  +  bA  -  c\A\^ A)  dX 

=  2Re  {-a !^{p'A  +  pAx)AxdX}  +2 §,^p{hr\A\^  -  Cr\A\*)dX. 

<  -Ur  ^j^p\Ax\'^dX  +  -  ')\A\^)dX, 

where  a  =  2hr  +  |a|^/ar  and  7  =  2cr-  Here  we  have  used  |p'(X)|  <  p{X)  for  all 
X  e  M  and 

2Rea  j  p'AAxdX  <  2|a|  J  p\A\\Ax\dX  <  ar  J  p\AxfdX  +  ^/ MfdX. 

With  a\A\‘^  -  'i\A\^  <  a(a  -  we  obtain  - 

a\A\^)  dX,  and  an  application  of  Gronwall’s  inequaUty  yields 

S^p{X)\A{T,X)\HX  <  e-“^4p(X)|^(0,X)|2dX  +  (1  -  6"“^)^  4p(X)dX 
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Since  this  estimate  is  also  true  when  p  is  replaced  by  the  translated  weight  Typ,  the 
first  estimate  is  proved. 

For  the  first  derivative  we  proceed  similarly  and  obtain 

^  J  p{a\Ax\^+2{\c\-2cr)\A\^\Ax\^'^  dX. 

(4.2) 

In  order  to  shorten  the  following  formulae  we  introduce  the  abbreviations  ej{T)  = 
\\A(T,  ■)||p.  and  rj{T)  =  for  j  =  0, 1,  and  2.  Prom  par¬ 

tial  integration  we  find  rf  <  ro(ri  -br2)  and  a  trivial  variant  of  Sobolev’s  embedding 
theorem  yields  ||A||^o  <  Cpeo{eo  +  ei).  With  these  stipulations  (4.2)  takes  the  form 

^^1  —  ~  {P  A  l)rj  +  {(5  +lA  a  +  Seo{eo  H-  ei))ro{ri  -f  r2) 

<  ~{P  -b  l)rf  +  s{t)ri  +  s^{t)/{Aar)  <  -/3r?  -b  (1  -f-  l/a^)s2(t)/4, 

where  /3  >  0  is  arbitrary,  6  =  2Cpmax{0,  ic|  -  2cr},  and  s{t)  =  (,3  -b  1  -b  a  + 
^eo(co  +  ei))ro.  Applying  Gronwall’s  inequality  and  using  the  same  estimate  for 
all  translated  weights  we  obtain 


el{T)  <  e-^^e?(0)  +  r,3+l-ba-b5[e§(r)+eo(r)ei(r)])  V 

<  e-^^ef(O)  +  ^  e-^(^--)([,3+l+a+^eg(r)]  V.52e2(r)ef(T))dr. 


Note  that  /?  is  still  arbitrary  in  the  above  estimate.  We  may  now  use  Lemma  4.2 
below  and  limsup^,^^  e§(r)  <  Aq  in  order  to  see  that  ei(T)  is  also  bounded  and 
satisfies 


limsupei(r)  < 

T-^oo 


+  1  +  a  +  ^Ao)‘ 


Setting  /5  —  1  +  a  +  <5Ao  +  we  obtain  the  desired  result.  □ 


Lemma  4.2 

Let  (i^v  E  C°(iR,  iR)  be  bounded  functions  with  ^{t)  >  0  for  all  t.  Assume  that 
13  >V  =  limsup^__,^  if[t)  and  that  the  continuous  function  p  €  C®(iR,  JR)  satisfies 


Pit)  <  p(0)e- 


-Pt 


f 


-«*-")[/r(r)  +  p(r)p(T)]dr, 


for  all  t  >0.  Then,  p  is  bounded  on  [0,  oo)  and  satisfies 


limsupp(t)  <  - — ^  limsup/i(r). 

t—^oo  P  r— »’00 


Proof:  We  let  w{t)  —  p{0)e  +  jje  +  i^p]  dr,  then  p{t)  <  w{t),  w{0)  = 

p(0),  and 


d 


dt 


w  = 


~ffw  +  //  +  l/p  <  (— /?  +  l/)w  +  p. 
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Prom  Gronwall’s  inequality  we  find  w{t)  <  p(0)e“(‘)  +  /o  where 

a{t)  =  Jo  ((^(r)  -  P)  dr.  Let  Ji  =  limsup^_«,  m(t)  and  take  e  >  0  with  V  +  e<p. 
Then,  there  is  a  time  to  such  that  ^(t)  <  Ji  +  s  and  i/(t)  <  n  +  £  for  t  >  to- 
With  n{t),v{t)  <  M  for  t  G  [0,to]  we  find  for  t  >  to  the  estimate  a{t)  -  a(to)  < 
-{l3-u-£){t-to)  axid 

w{t)  =  (p(0)e“(‘<>)  +  /o"  +  fl  dr 

<  (p(0)  +  +  fl  +  e). 

Thus,  we  have  limsup^^oo  w(t)  <  (p  +  e)/(/?  —  and  since  p{t)  <  'w{t)  and 

£  >  0  was  arbitrary  the  result  follows.  ^ 


In  the  uniformly  local  functions  spaces  the  method  of  Fourier  transform 

is  still  available,  namely  in  the  form  of  the  so-called  multiplier  theory.  An  operator 
^  is  called  a  multiplier  if  it  is  defined  by  multiplying  the  Fourier 
transform  u  =  /u  by  a  function  m  €  L^{1R,  (D)  and  then  doing  an  inverse  Fourier 
transform.  Using  the  following  lemma,  which  is  proved  in  [Sch94b],  allows  us  to 
study  the  mapping  properties  of  M  :  u  i— >  T~^{rnJ-u).  Natural  applications^are 
convolution  operators  Mu(x)  =  /j^  ra{x  —  y)u{y)  dy  with  m  integrable,  where  m  — 
Tm. 


Lemma  4.3  _ 

Let  q,selNo  and  w,-,(k)  =  (1  +  G  Ci(IR,  (T).  Then  M  :  if,,, 

Hi^\u  I-*  T~\rhTu)  is  well  dehned  with  the  estimate 


\\Mu\\h 


s 

l,u 


<  C'(9,s)|lu;s_,||c2(jR,(r)ll“llH’„. 


where  C{q,  s)  does  not  depend  on  in. 

Using  this  result  it  is  trivial  to  see  that  the  operator  G(T)  =  defines  a 

holomorphic  semigroup  on  each  Y”  and  similarly  S(t)  =  defines  a  holomor- 

phic  semigroup  on  each  Z".  Moreover,  the  analogues  of  the  estimates  m  Lemma 
3.1  hold: 


llG(r)lh 


^ym 


^  Cn—m{^  “I"  ^ 


—  (n— 1 


a)/4\ 

(4.3) 


where  n  <m. 

In  fact,  all  the  theory  of  Section  3  can  be  carried  through  in  these  spaces  with 
only  one  modification,  namely  the  definition  of  the  mode  projections  Pa,  Pc,  and 
Pq.  According  to  Lemma  4.3  we  have  to  use  smooth  cut-off  functions  in  order 
to  define  the  mode  separation:  we  choose  £  G  C^(iR,  [0, 1])  with  x(^)  =  1 
|fe|  <  1/5  and  0  for  \k\  >  1/4.  The  critical  part  is  now  separatee^ using  Ec,  which 
is  defined  via  EcU  =  E~'^{XcPu),  wher^£c(fc)  =  x(fc  +  1)  +  x(-l  “  ^)-  This 
operator  Ec  is  a  bounded  operator  from  Z°  into  for  each  n  G  JVq,  however,  it 
is  no  longer  a  projection.  Since  Ec  deletes  all  Fourier  modes  outside  the  interval 
[_5/4, —3/4]  u  [3/4, 5/4]  while  other  modes  are  kept  or  diminished,  the  operator 
Ec  is  called  a  mode  filter.  In  addition,  we  define  EaU  =  u-  EcU  and  the  splitting 
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u  —  Uc  Us  with  Uc  —  EcU.  Moreover,  we  need  the  mode  filter  Eq  which  is  given 
via  Equ  =  and  extracts  the  Fourier  modes  concentrated  around  0. 

When  all  Pc, 5,0  are  replaced  by  Pc,s,o,  respectively,  all  the  results  of  Section  3 
can  be  worked  out  completely  similar  as  in  the  L^~case.  We  refer  to  [Sch94c,  MS94] 
for  the  details  and  give  here  an  overview  on  some  further  results. 


4.2  The  attractivity  of  the  set  of  modulated  patterns 

Here  we  treat  the  question  why  modulated  waves  are  of  such  an  importance.  The 
Ginzburg-Landau  equation  is  formally  derived  to  describe  the  evolution  of  the  lin¬ 
early  unstable  modes.  In  weakly  nonlinear  theory  one  expects  that  these  modes 
dominate  the  dynamics  of  the  full  problem  under  arbitrary  initial  conditions.  In  the 
last  section  error  estimates  are  shown  under  the  assumption  that  the  initial  condi¬ 
tions  already  possess  the  scaled  mode  structure.  It  is  our  aim  to  explain  why  after 
the  time  h  —  Txje^  all  solutions  of  the  full  problem  develop  this  mode  structure 
and,  henceforth,  can  be  described  by  the  solutions  of  the  associated  Ginzburg- 
Landau  equation. 

The  first  result  in  this  direction  was  obtained  in  [Eck93].  For  our  choice  of 
function  spaces  the  result  is  given  in  [Sch94c].  We  note  here  that  all  the  results 
given  below  are  only  established  for  the  case  that  the  Ginzburg-Landau  equation 
(4.1)  has  real  coefficients,  a,  6,c  €  M.  However,  it  is  clear  that  all  statements  and 
proofs  can  easily  be  translated  into  the  present  context  of  the  complex  Ginzburg- 
Landau  equation. 

Since  multiple  scaling  analysis  is  only  a  local  theory  this  attractivity  can  only 
be  expected  for  initial  conditions  in  a  small  neighborhood  of  the  trivial  solution. 

Theorem  4.4 

(Attractivity  of  the  set  of  modulated  patterns) 

Fix  Cl  >  0,  then  there  exists  Co  >  0,  Sq  >  0,  and  Ti  >  0,  such  that  the  following  is 
true.  For  all  e  €  (0,£o)  and  uq  e  Z  with  ||no||^i  <  CiS  the  solution  u(t}  =  Sf(uo) 

exists  for  t  G  [O^Ti/e^]  and  there  is  an  Aq  e  with 

P0II9,  <  Co  and  +  (4.4) 

Note  that  can  be  replaced  by  the  function  Eq  as  the  constant  factor 

can  be  compensated  into  Aq.  Thus,  we  may  consider  the  set  of  modulated  patterns, 

MP  =  {£(W,^oEo  +  H^Eo^)  :  IIA0II91  <Go}, 

as  a  small  very  flat  ellipsoid  in  the  phase  spac^Z^,  and  MP  attracts  all  solution 
starting  in  the  ball  of  radius  CiS  around  0  €  Z\  at  least  up  to  an  error  of  order 
£3/2. 

Using  the  result  from  above  it  is  possible  to  apply  the  approximation  theory  of 
the  previous  section.  In  particular,  the  assumptions  of  Theorem  3.6  are  fulfilled 
when  n(0)  =  and  A(0)  =  Aq  is  chosen  and  Tq  >  0  is  any  time  such  that 

A{T)  =  Qt{Ao)  exists  for  T  G  [0,To]. 
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The  development  of  the  mode  structure  is  essentially  a  linear  effect,  and  for 
illustrative  purposes  we  establish  below  the  linear  analogue  of  Theorem  4.4.  We 
find  that  the  time  scale  0(l/£^)  is  necessary  for  the  solutions  w(t)  to  develop  the 
scaled  mode  structure.  For  the  nonlinear  theory  it  is  then  necessary  to  show  that  the 
solutions  of  the  nonlinear  problem  exist  over  the  desired  time  interval  t  E  [0,  Ti/e^]. 
For  general  problems  with  quadratic  nonlinearity,  solutions  of  order  e  exists  only 
on  the  time  scale  l/s,  but  again  the  fact  can  be  used  that  quadratic  interactions  of 
critical  modes  generate  damped  modes.  The  time  Ti  has  to  be  chosen  sufficiently 
small  to  avoid  blow-up  of  solutions,  which  may  occur  in  the  case  when  Re  c  <  0  in 
(4.1)  (see  below  for  the  longtime  existence  in  the  case  Rec  >  0). 

The  key  step  for  isolating  the  Ginzburg-Landau  mode  A  E  l^from  a  general 
function  w  E  Z  is  to  find  an  approximate  inverse  of  the  mapping  ^ 

Z^.  We  let 

Using  the  counterpart  of  Lemma  3.4  in  (see  [Sch94c])  we  have 

^  ^  e{We{He,t)u)Et  +  W,($(£,t)ii)E-i) 

The  operator  $(£,t)  :  ^  has  a  norm  of  order  and  a  function  u 

may  be  called  to  have  mode  structure  when  £||$(e,  Ikll^n-  The  following 

lemma  shows  that  the  linear  part  develops  the  desired  mode  structure  for  times 
t  >  Txje^  with  Ti  >  0. 

Lemma  4.5 

Let  S{t)  =  t  >  0,  then  for  each  n  e  IN  there  is  a  constant  such  that 

<^(l  +  ,n(tn/4\tn7^)- 

Hence  for  each  Tj  >  0  the  operator  s^(£,t)S(t)  :  Z°  9’'  is  bounded  indepen¬ 
dently  of  e  for  all  t  G  [Ti/e^,  oo). 

Proof:  Letting  A  =  $(e,t)S'(t)w  we  find  that  the  Fourier  transform  of  A  is  given 
by  J^A{K)  =  m{£,t,K){Tu)(\  -h  eK),  where 

m{e,  t,  K)  =  x(3(l  + 

Here  the  old  wave  number  k  is  replaced  by  1+  sK  due  to  the  shift  from  Et  and  the 
spatial  scaling.  The  factor  1  je  is  compensated  by  transforming  the  Fourier  integral 
from  k  to  K.  Hence,  The  operator  ^(e,  t)S{t)  can  be  written  as  the  composition  of 
M{e,t)N{e,t)  where  N{e,t)u  =  and  M  is  defined  by  the  multiplier 

associated  to  the  function  fh{£,t,K),  Using  Lemma  4.3  we  find 

WMieMyo^yn  ^  IRe.i.Olb  +  +  <"/')])• 

Moreover  ||-^(^j  ^)||^o_9o  ~  result  is  established.  D 

From  the  estimate  in  the  lemma  we  see  that  for  small  t  the  smoothing  of 
t)S{t)  occurs  through  the  fourth  order  operator  L{dx)  which  leads  to  the  weak 
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singularity  However,  this  smoothing  does  not  generate  the  properly  scaled 

modes.  The  mode  scaling  is  only  enforced  by  the  quadratic  part  of  Ao(l  +  e:i^)  and 
thus  is  associated  to  the  stronger  singularity 

4.3  Shadowing  by  pseudo-orbits 

Finally  we  consider  the  case  where  the  Ginzburg-Landau  equation  is  stable,  i.e., 
Cr  =  Rec  >  0  in  (4.1).  Prom  Theorem  4.1  we  know  that  the  solutions  of  the 
Ginzburg-Landau  equation  stay  inside  an  absorbing  ball  or  converge  exponentially 
to  it.  Hence,  we  may  use  the  approximation  property  as  well  as  the  attractivity  to 
control  any  solution  of  the  original  problem  (2.1)  for  arbitrarily  long  time  intervals. 
The  idea  is  to  make  a  first  step  to  find  the  mode  structure  u{ti)  =  U\^{ti)  +  0{e^) 
where  =  #(£,  ti)u(ti),  then  the  solution  can  be  described  over  a  time 

interval  +  Ti/s^]  by  the  Ginzburg-Landau  approximation  where  Ai 

solves  the  Ginzburg-Landau  equation  with  the  above  initial  condition.  At  the  end 
of  this  interval  the  error  between  the  true  solution  and  the  approximation  might  be 
large;  however,  from  the  attractivity  we  know  that  the  true  solution  u{t2)  is  close  to 
a  different  modulated  pattern  from  which  the  approximation  theory  may  start 
on  the  next  interval.  The  exponential  attractivity  of  the  absorbing  ball  controls  the 
size  of  the  solution  A  and  guarantees  that  A  stays  bounded  for  all  T  >  0. 

In  this  way  we  can  shadow  the  true  orbit  u(t)  =  Sf{uo)  for  all  ^  >  0  by  a 
sequence  of  solutions  of  the  Ginzburg-Landau  equation. 

Definition  4.6  ^ 

Let  Ti  >  0  and  k  >  0.  A  function  A  e  L^((0,  oo),  F^)  is  called  a  (Ti,  k) -pseudo- 
orbit  in  for  (4.1)  if  for  ail  n  ^  IN  the  relations 

A((n  -  l)Ti  +t)  -  gr(A((n  -  l)r))  for  all  r  e  [0,Ti), 

hold,  where  gr  is  the  semigroup  associated  with  (4.1)  and  A(T—0)  —  lim^ ^('^)- 
The  following  result  is  Theorem  3  of  [Sch94c]. 

Theorem  4.7 

(Shadowing  by  pseudo-orbits) 

Let  Rec  >  0  in  (4.1);  then  for  all  Ti  >  0  there  exist  positive  constants  Sq,  C,  and 
To  such  that  for  all  e  G  (0,  £o]  the  following  is  true: 

For  all  initial  conditions  uq  with  Httoll^i  ^  s  the  solution  u{t)  =  St{uo)  ex¬ 
ists  for  all  time  and  there  is  a  -pseudo-orbit  A  for  (4.1)  which  satisfies 

||A(0)||y  <  C  and  approximates  u(t)  as  follows: 

+  To/e")  -  U\ (t) II for  allt>0. 

This  shadowing  technique  can  be  used  to  show  thatjn  fact  all  solutions  starting 
in  a  small  but  6-independent  neighborhood  Z//  of  0  in  exist  for  all  time  and  are 
finally  absorbed  into  a  ball  of  radius  Ce.  This  result  is  interesting  in  so  far  as  it 
devices  a  new  way  to  construct  absorbing  sets,  namely  by  studying  the  dynamics 
rather  than  applying  energy  estimates  which  would  result  in  bounds  of  order  6^/^, 
see  the  discussion  in  [MS94]. 
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Theorem  4.8 
(Locally  absorbing  set) 

Let  (4.1)  be  the  modulation  equations  associated  to  (2.1)  such  that  >  0.  Then 
there  exist  positive  constants  Sq?  ^  such  that  for  all  e  E  (0,£o] 

lutions  u{t)  =  St{uo)  of  (2.1)  with  WuqW^i  <  So  exists  for  all  t  >  0  and  satisfy 
limsupt_^  \\u{t)\\^,  <  Ce. 

4.4  Comparison  of  attractors 

Using  the  weighted  energy  estimates  which  are  introduced  in  the  proof  of  Theory 
4.1  it  is  not  only  possible  to  construct  absorbing  balls  for  the  semigroups  Qt  on 
and  SI  on  but  we  are  also  able  to  find  attractors  in  the  following  sense.  We 
refer  to  [MS94]  for  the  following  two  results. 

Theorem  4.9 

(Existence  of  an  attractor) 

Let  Qt  be  the  nonlinear  semigroup  of  the  Ginzburg-Landau  equation  (4.j.)  with 
Cr  >  0  posed  in  FU  Then,  there  is  a  non-empty  closed  bounded  set  Ag  C  with 
the  following  properties: 

(i)  Ag  attracts  bounded  sets  in  i.e.,  for  any  bounded  B  C  F^  we  have 

distal  (GT(B),Ag}  =  sup  inf  WGri^)  -  a||-i  0  for  T  oo, 

(a)  Ag  is  time  and  translation  invariant,  Le.,  GriAg)  =  TyAg  =  Ag  for  all  T  > 

0,  y  G  M. 

(Hi)  A  is  localized  compact  in  the  sense  that  Ag  is  bounded  in  Y^  and  compact 
when  the  weighted  norm  \\A\\p  =  ( fj^p{\A\^  +  \A'\‘^)dX)  ^  is  used  in  Y^, 

The  proof  of  this  theorem  is  based  on  ideas  in  [Fei93].  Note  that  the  attractor  Ag 
is  nontrivial  as  it  contains  all  the  attractors  Ag  which  are  obtained  by  restricting 
(4.1)  ti  the  space  of  functions  with  period  f. 

Similarly  there  exists  attractors  A^  for  the  semigroups  Sf.  The  question  is  now 
as  to  how  well  the  attractors  A^  can  be  described  by  the  attractor  of  the  limit 
problem  which  is  the  Ginzburg-Landau  equation.  A  first  result  is  given  in  the 
following  theorem. 

Theorem  4.10 
(Comparison  of  attractors) 

For  every  a  >  0  there  exist  C,£o  >  0  such  that  for  all  £  E  (0,  £o]  the  estimates 
distrp ^ (e,  0) A^,  Ag)  <  a  and  dist^^(EsA^,{0})  < 


hold. 

This  result  means  that  A^  is  upper  semicontinuous  towards  the  attractor  Ag  of 
the  Ginzburg-Landau  equation  in  the  sense  that 

disty  ($(e,  0).4®,  Ag)  +  ^distz{EsA^,  {0})  ^0  for  e  ->  0. 
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Thus,  the  solutions  u  in  the  attractor  have  relatively  small  stable  parts  Us  —  EgU 
and  the  critical  part  Uc  is  given  approximately  by  a  Ginzburg-Landau  mode  U\{0) 
where  A  is  in  the  limit  attractor  Ag .  In  this  way  we  have  obtained  an  upper  bound 
on  the  richness  of  the  attractor  A^^  It  is  an  unsolved  problem  to  show  the  opposite 
direction,  namely  that  A^  is  also  as  rich  as  the  attractor  Ag^  which  means  a  lower 
bound  on  the  complexity  of  the  attractor.  In  mathematical  terms  this  means  lower 
semicontinuity  in  the  sense  disty (^gi,  $(5, 0)^^)  0  for  £  ^  0. 


References 

[B0I94]  P.  Bollerman.  Validity  of  the  Ginzburg-Landau  approximation  in  2- 
dimensional  Poiseuille  flow.  Manuscript  University  of  Utrecht,  1994. 

[CaE87]  F.  Calogero  and  W.  Eckhaus.  Nonlinear  evolution  equations,  rescal¬ 
ings,  model  PDEs  and  their  integrability:  I  +  II.  Inverse  Problems^  3  (1987) 
229-262;  and  4,  (1988)  11-33. 

[Co94]  P.  Collet.  Thermodynamic  limit  of  the  Ginzburg-Landau  equations.  Non¬ 
linearity  7  (1994)  1175-1190. 

[CoE90]  P.  Collet,  J.-P,  Eckmann,  The  time  dependent  amplitude  equation  for 
the  Swift-Hohenberg  problem.  Comm.  Math.  Phys.,  132  (1990)  139-153. 

[DES71]  R.C.  DiPrima,  W.  Eckhaus,  and  L.A.  Segel.  Non-linear  wave- 
number  interaction  in  near-critical  two-dimensional  flows.  J.  Fluid  Me¬ 
chanics,  49  (1971)  705-744. 

[Eck93]  W.  Eckhaus.  The  Ginzburg-Landau  equation  is  an  attractor.  J.  Nonlin¬ 
ear  Science,  3  (1993)  329-348. 

[Fei93]  E.  Feireisl.  Locally  compact  attractors  for  semilinear  damped  wave  equa¬ 
tions  on  R^.  Manuscript,  Institute  of  Math.  CSAV,  Prague,  1993. 

[Hen81]  D.  Henry.  Geometric  Theory  of  Semilinear  Parabolic  Equations.  Lecture 
Notes  in  Mathematics  Vol.  840,  Springer-Verlag  1981. 

[IM91]  G.  looss,  A.  Mielke.  Bifurcating  time-periodic  solutions  of  Navier- 
Stokes  equations  in  inflnite  cylinders.  J.  Nonlinear  Science,  1  (1991)  107- 
146. 

[IMD89]  G.  looss,  A.  Mielke,  Y.  Demay.  Theory  of  steady  Ginzburg-Landau 
equation,  in  hydrodynamic  stability  problems.  Europ.  J.  Mech.,  B /Fluids, 
8(3)  (1989)  229-268. 

[Kal89]  L.A.  Kalyeikin.  Long  wave  asymptotics,  integrable  equations  as  asymp¬ 
totic  limits  of  non-linear  systems.  Russian  Math.  Surveys,  44(1),  3-42, 
1989. 

[KSM92]  P.  Kirrmann,  G.  Schneider,  A.  Mielke.  The  validity  of  modula¬ 
tion  equations  for  extended  systems  with  cubic  nonlinearities.  Proc.  Royal 
Society  Edinburgh,  122 A  (1992)  85-91. 


216 


ALEXANDER  MIELKE  AND  GUIDO  SCHNEIDER 


[MS94]  A.  Mielke,  G.  Schneider.  Attractors  for  modulation  equations  on  un¬ 
bounded  domains  —  existence  and  comparison  .  IfAM— Preprint  Univer¬ 
sity  Hannover  1994.  Submitted  to  NonlineaHty. 

[NPL93]  A.C.  Newell,  T.  Passot,  J.  Lega.  Order  parameter  equations  for 
patterns.  Annual  Reviews  Fluid  Mech.,  25  (1993)  399-453. 

[NW69]  A,  Newell,  J.  Whitehead.  Finite  bandwidth,  finite  amplitude  convec¬ 
tion.  J.  Fluid  Mech.,  38  (1969)  279-303. 

[Sch94a]  G.  Schneider.  A  new  estimate  for  the  Ginzburg-Landau  approximation 
on  the  real  axis.  J.  Nonlinear  Science,  4  (1994)  23-34. 

[Sch94b]  G.  Schneider.  Error  estimates  for  the  Ginzburg-Landau  approximation. 
J.  Appl  Math.  Physics  (ZAMP),  45  (1994)  433-457. 

[Sch94c]  G.  Schneider.  Global  existence  via  Ginzburg-Landau  formalism  and 
pseudo-orbits  of  Ginzburg-Landau  approximations.  Comm.  Math.  Phys., 
164  (1994)  157-179. 

[Sch94d]  G.  Schneider.  Validity  and  limitation  of  the  Newell-Whitehead  equa¬ 
tion.  IfAM-Preprint  University  Hannover  1994.  Submitted  to  Math. 
Nachrichten. 

[Sch94e]  G.  Schneider.  Justification  of  modulation  equations  for  hyperbolic  sys¬ 
tems  via  normal  forms.  IfAM-Preprint  University  Hannover  1994.  Sub¬ 
mitted  to  Mathematische  Zeitschrift. 

[vHa91]  A.  van  Harten.  On  the  validity  of  Ginzburg-Landau’s  equation.  J. 
Nonlinear  Science,  1  (1991)  397-422. 


Address: 

Alexander  Mielke,  Guido  Schneider 
Institut  fur  Angewandte  Mathematik 
Universitdt  Hannover 
Welfengarten  1 
30167  Hannover 
GERMANY 

E-Mail:  mielke@ifam.uni-hannover,de 


Section  IV 


Fluid  Mechanics 
and  Turbulence 


Lectures  in  Applied  Mathematics 
Volume  31,  1996 


Navier- Stokes  Equations  and  Incompressible 
Fluid  Turbulence 

PETER  CONSTANTIN 


1.  Introduction 

In  this  talk  I  will  discuss  informally  mathematical  questions  related  to  turbu¬ 
lence.  The  equations  of  motion  of  incompressible  fluids  are  described  as  follows: 
Let  X  e  denote  a  point  in  space,  n  =  2  or  3  and  let  t  >  0  denote  time.  One 
associates  to  the  velocity  u{x^t)  £  a  first  order  differential  operator,  Dt: 

Dt  =  dt  +  u{x,  t)-V. 


Dt  is  the  so-called  material  derivative;  its  characteristics  -  solutions  of  the  ODE 

f=uiX,t) 

are  called  particle  trajectories.  The  Navier-Stokes  equations  are 


DtU  +  Vp  =  i/Aw  -t-  /, 

V  •  =  0. 

The  number  >  0  is  the  kinematic  viscosity  of  the  fluid.  If  one  sets  u  =  0  in 
the  equation  above  one  obtains  the  Euler  equations  of  ideal  fluids.  The  scalar 
function  p{x^  t)  represents  pressure;  its  mathematical  role  is  to  maintain  the  con¬ 
straint  of  incompressibility  V  •  tt  =  0.  The  functions  /  represent  body  forces. 
The  fluid  occupies  a  region  G  C  R^  and  appropriate  boundary  conditions  are 
prescribed.  A  great  variety  of  physical  situations  are  described  by  the  nature 
of  body  forces  and  boundary  conditions.  Two  examples,  Taylor- Couette  and 
Rayleigh-Benard  turbulence  can  serve  as  a  guide  to  posing  questions  regarding 
turbulence.  In  the  Taylor  -  Couette  setting  fluid  is  placed  in  the  space  between 
two  concentric  vertical  cylinders  and  one  of  the  cylinders  is  rotated,  entraining 
the  fluid.  In  the  Rayleigh  -  Benard  setting  fluid  is  placed  in  a  closed  container 
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and  heated  from  below.  The  external  conditions  are  encoded  in  non-dimensional 
parameters:  Reynolds  number,  Rayleigh  number.  They  represent  a  measure  of 
the  strength  of  the  externally  supplied  energy  (determined  for  instance  by  how 
fast  the  cylinders  are  rotated  or  how  much  the  fluid  is  heated).  The  term  non- 
dimensional  refers  to  scale  invariance.  The  experimenters  prepare  an  experiment 
at  given  values  of  the  control  parameters.  The  experiment  ([1],  [2])  is  allowed 
to  run  for  a  sufficiently  long  time  in  order  to  make  sure  that  one  registers  the 
time  asymptotic  regime.  The  information  (one  point  measurements  or  rarely 
two  point  measurements)  is  recorded  and  processed,  taking  time  averages.  New 
values  of  the  control  parameters  are  selected  and  the  process  is  repeated.  As 
the  non-dimensional  control  parameters  are  increased  the  behavior  of  the  fluid 
changes,  very  roughly  speaking  from  simple  to  complex.  At  the  high  end  of 
the  parameter  scale  we  find  incompressible  turbulence.  The  appropriate  math¬ 
ematical  turbulence  problem  is  thus:  study  the  long  time  behavior  of  solutions 
to  Navier-Stokes  equations  at  a  fixed  Reynolds  number  (fixed  i/  >  0,  boundary 
conditions  and  body  forces  /).  Record  it  using  appropriate  measures  or  averages. 
Then  increase  the  Reynolds  number  (for  instance,  decrease  viscosity,  keeping  all 
other  conditions  the  same)  and  study  the  large  Reynolds  number  limit. 

2.  Dynamical  Systems  Paradigm 

The  finite  dimensional  dynamical  system  paradigm  ([3])  has  predictions  ([4]) 
regarding  the  transition  to  turbulence  that  have  been  verified  experimentally. 
Consider  the  case  of  two  dimensional,  spatially  periodic  Navier-Stokes  equations 
with  time  independent  forces.  At  a  fixed  Reynolds  number  the  Navier-Stokes 
equations  are  solved  by  a  nonlinear  semigroup  S{t)  in  a  Hilbert  space  H,  the 
space  of  divergence-free  square  integrable  velocities.  The  norm  |  ■  is  the  square 
root  of  the  total  kinetic  energy  and  S{t)uQ  represents  the  velocity  at  time  t.  The 
semigroup  is  dissipative:  there  exists  a  compact  set  in  H  such  that  all  trajectories 
S{t)uo  belong  to  it  for  large  enough  t.  The  semigroup  has  a  global  attractor  A: 
a  compact  invariant  (5'(^)(A)  =  A)  set  that  contains  all  omega  limit  sets.  All 
Borel  measures  which  are  invariant  under  S{t)  are  supported  in  A.  This  set  has 
finite  fractal  dimension  ([5]).  The  dimension  is  related  to  the  Reynolds  number 
([6])  as  predicted  by  the  argument  of  Landau  ([7])  concerning  the  number  of 
degrees  of  freedom  of  turbulence.  The  existence  of  a  finite  dimensional  attractor 
and  the  time  analyticity  of  solutions  are  used  in  results  that  are  valid  beyond 
the  transition  to  turbulence.  Here  are  two  examples. 

Consider  the  set  of  global  solutions 

g  =  {uq  €  H;  S{t)uQ  extends  to  i  G  R} . 

The  global  attractor  can  be  described  by 

A  =  {uq  G  G]  S{t)uo  is  bounded  fort  G  R} . 
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Consider  now  the  set  of  those  global  solutions  that  grow  at  most  exponentially 
backward  in  time 


uq  e  lim  sup 

t—^—oo 


log|5(t)uo|i/ 


One  can  prove  ([8])  that  A4  is  weakly  dense  in  ff.  The  dynamics  on  can  be 

related  to  the  inviscid  Eulerian  dynamics:  rescaling  appropriately  velocity  and 
time,  one  obtains  the  Euler  equations  as  the  infinite  negative  time  limit.  Thus, 
for  velocities  selected  from  the  rich  invariant  set  \vA,  quasi-Eulerian  dynamics 
describe  the  past  and  finite  dimensional  dynamics  on  the  Navier-Stokes  attractor 
Aj  the  future. 

One  of  the  most  common  ways  to  analyze  temporal  data,  such  as  the  tem¬ 
perature  0{t)  recorded  at  a  given  location  in  the  Rayleigh-Benard  convection 
experiment  is  to  compute  the  power  spectrum, 


The  mathematical  justification  of  this  operation  is  based  on  classical  work  on 
stochastic  signals,  going  back  to  Wiener,  Khintchin  and  Kolmogorov.  At  finite 
Rayleigh  and  Reynolds  numbers  though,  the  signal  is  not  a  random  process, 
rather  it  is  the  output  of  a  finite  dimensional  dynamical  system.  Using  the  prop¬ 
erties  of  solutions  of  Navier-Stokes  equations  one  can  still  make  mathematical 
sense  of  the  procedure.  Moreover  one  can  prove  that  the  power  spectrum  must 
decay  at  least  exponentially  at  high  frequencies  ([9]).  Specifically,  the  power 
spectrum  of  the  temporal  data  obtained  by  evaluating  a  solution  at  some  fixed 
location  is  a  well  defined  positive  Borel  measure  P{duj)  and  satisfies 


P{djj)  <  oo. 


The  positive  constant  r  =  r  (A)  depends  only  on  the  attractor,  in  other  words 
only  on  the  control  parameters. 

The  dynamical  systems  paradigm  is  applicable,  under  the  assumption  of  reg¬ 
ularity,  to  the  three  dimensional  Navier-Stokes  equations  in  a  bounded  do¬ 
main,  at  fixed  Reynolds  number.  Even  if  one  is  willing  to  accept  this  as¬ 
sumption  (a  reasonable  position,  because  finite  time  singularities  in  the  Navier- 
Stokes  equations  are  not  physical),  the  finite  dimensional  dynamical  system 
picture  is  incomplete:  there  are  no  (known)  inertial  manifolds.  A  finite  di¬ 
mensional  Lipschitz  manifold  X  in  the  phase  space  H  is  an  inertial  manifold 
if  it  is  forward  invariant  C  X)  and  attracts  exponentially  all  trajecto¬ 

ries,  distH{S{t)uo,X)  <  C ex.p{—kjt).  Such  manifolds  are  known  to  exist  for 
dissipative  systems  other  than  the  Navier-Stokes  one  ([10],  [11]),  but  not  for 
Navier-Stokes. 
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3.  Dissipation 

One  of  the  most  common,  imporant  and  accessible  effects  of  turbulence  is  bulk 
dissipation.  Torque  in  Taylor- Couette  flow,  drag  in  flow  past  an  obstacle,  and 
heat  transfer  in  Rayleigh-Benard  convection  are  examples  of  bulk  dissipation 
quantities.  These  are  one-point  quantities  (that  is,  numbers,  not  functions) 
that  depend  on  the  key  parameters  (Reynolds  number,  Rayleigh  number)  in  a 
reproducible  manner:  they  obey  empirical  laws  obtained  in  physical  experiments 
([1],  [2]). 

I  will  describe  a  method  ([12],  [13],  [14])  to  estimate  rigorously  such  quantities 
directly  from  the  equations  of  motion.  I  will  use  a  simplified  Taylor- Couette 
setting  to  illustrate  this  method.  Let  us  consider  the  incompressible  Navier- 
Stokes  equations 


(1) 


dfU  +  u  •  Vu  -h  Vp  =  Au 


V  'U  =  0 

in  a  2-D  strip,  with  boundary  conditions 

u{x-\-i,y,t)  =  u{x,y,t) 
u{x,0,t)=0;  u{x,l,t)~Ilex 

The  Reynolds  number  Re  is  a  large  positive  number,  x  is  the  direction  of  the  a; 
axis  and  i>  0  represents  the  aspect  ratio.  Consider 

T 

<  \\uf  >=\imsupT^^^ 
where  ||u|p  —  D(it)  is  the  bulk  dissipation: 

||w||^  =  /  f  I Vtip  dxdy. 

Jo  Jo 

The  problem  is  to  estimate  <  ||?/||^  >  on  solutions  as  a  function  of  Re  as 
Re  -4  cxD. 

The  inhomogeneous  boundary  conditions  together  with  the  finiteness  of  the  ki¬ 
netic  energy  and  of  the  dissipation  determine  an  affine  set  U  in  function  space. 
We  consider  any  time  independent  function  b  eU  { “the  background” )  and  asso¬ 
ciate  to  it  a  corresponding  linear  operator  £5.  In  the  Taylor- Couette  case  this 
operator  is  computed  as  follows: 

Cb  =  A-\-  2Sb 


where 


((V6)  +  (V6)*) 
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P  is  the  Leray-Hodge  projector  on  divergence-free  vectors  in  and  A  the  Stokes 
operator  P(-A).  Let  A(b)  denote  the  bottom  of  the  spectrum  of  £5  in 

Theorem  1.  Assume  that  h  eU  satisfies  P{b  •  V6)  =  0  and  X{b)  >  0.  Then, 
every  solution  u{t)  in  lA  satisfies 

<lhWf  ><  iwp. 

The  interpretation  of  this  result  is  the  following.  Take  any  steady  solution 
of  the  inviscid  (Euler)  equation  P  (6  ■  V6)  =  0  with  the  correct  boundary  con¬ 
ditions  {b  E.  U)  and  compute  its  quadratic  form  stability  as  if  it  were  a  steady 
solution  of  the  Navier-Stokes  equation  with  half  the  given  viscosity  (|  in  our  non- 
dimensional  setting;  this  is  related  to  the  fact  that  the  nonlinearity  is  quadratic). 
If  the  solution  is  stable  (A (6)  >  0)  then  its  bulk  dissipation  is  an  upper  bound 
for  the  long  time  average  bulk  dissipation  of  any  solution  of  the  viscous  problem. 

Examples  of  such  functions  are  obtained  by  choosing  flat  shear  flow  back¬ 
grounds  with  sharp  boundary  layers.  Choosing  the  size  of  the  boundary  layer 
of  the  order  of  Re"^  ensures  that  X{b)  >  0.  The  requirement  that  the  back¬ 
ground  must  satisfy  the  steady  Euler  equations  can  be  dropped;  then  the  bulk 
dissipation  is  bounded  by  a  more  complicated  expression  ([13]). 

Using  Theorem  1  one  can  prove  that 

<\\u{t)f  ><  CBje\ 

Moreover,  C  can  be  estimated  explicitly  and  the  results  agree  with  the  phys¬ 
ical  experiment  ([2],  [12],  [13],  [14]).  In  order  to  improve  the  estimate  one  is 
naturally  lead  to  variational  problems  with  spectral  side  conditions.  A  set  of 
backgrounds  A  is  chosen,  such  that  ifbE  A  then  P(6  •  V6)  =  0.  The  set  of  shear 
backgrounds 

is  a  natural  and  simple  example.  In  A  one  considers  the  subset 

Cx  =  {bEA-X{b)  >0}. 

The  minimization  problem  is  to  compute 

This  problem  leads  to  new,  nonlinear  Orr-Sommerfeld-like  equations  ([13]).  Bi¬ 
furcations  in  the  boundary  of  the  sets  Cx  as  the  control  parameters  are  varied 
can  be  responsible  for  sudden  transitions  in  the  scaling  law  of  the  bulk  quan¬ 
tity.  This  can  happen  as  a  consequence  of  the  finite  size  (discrete  spectrum) 
of  the  domain.  If  the  condition  of  divergence-free  is  dropped,  the  problem  sim¬ 
plifies  considerably.  Then  there  are  no  sudden  transitions  and  the  constrained 
Euler-Lagrange  equations  lead  to  a  nonlinear  eigenvalue  problem  for  a  cubic 
steady  one  dimensional  Schrodinger  equation.  This  can  be  solved  explicitly,  at 
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each  Reynolds  number  and  the  asymptotic  behavior,  as  the  Reynolds  number  is 
increased  can  be  computed  explicitly.  The  outcome  is  an  improvement  in  the 
prefactor  C  but  the  power  law  remains  the  same.  This  law  is  consistent  with  the 
Kolmogorov  dissipation  law  (see  next  section)  which,  however  was  not  envisioned 
to  apply  in  such  boundary-dominated  situations.  The  fundamental  question  re¬ 
mains:  does  the  presence  of  boundaries  alter  the  Kolmogorov  dissipation  law?  In 
particular,  does  the  logarithmic  wall  friction  law  rigorously  represent  the  infinite 
Reynolds  number  asymptotics?  All  our  preliminary  results,  including  in  addition 
to  Taylor-Couette  turbulence,  Rayleigh-Benard  convection  and  channel  flow  are 
consistent  with  the  asymptotic  predictions  of  the  Kolmogorov  dissipation  law . 

4.  Stochastic  PDE  Paradigm 

The  idea  to  describe  turbulence  in  terms  of  statistical  rather  than  deteministic 
solutions  is  not  new  [15].  Contemporary  theories  ([16],  [17])  use  tools  of  statis¬ 
tical  physics,  but  the  contact  with  the  Navier-Stokes  equations  is  not  complete. 
The  mathematical  existence  theory  for  statistical  solutions  of  the  Navier-Stokes 
equations  ([18],  [19],  [20])  is  developed  to  a  similar  extent  as  is  the  existence 
theory  for  deterministic  solutions.  The  temporal  power  spectrum  of  wind  tun¬ 
nel  data  provides  strong  experimental  verification  of  one  of  the  beacons  in  the 
subject  of  turbulence,  the  Kolmogorov  theory  ([21]).  Grounded  in  dimensional 
analysis,  this  theory  proposes  the  existence  of  universal  scaling  behavior  in  fully 
developed  turbulence.  According  to  the  Kolmogorov  dissipation  law,  the  energy 
dissipation  rate 

e  =  v{\Vu(x,t)\^) 

is  a  positive  constant  that  is  bounded  independently  of  viscosity.  The  braces 
<  •*•>  represent  ensemble  average  (functional  integration).  Kolmogorov 
assumed  homogeneity  and  isotropy  (invariance  with  respect  to  translations  and 
rotations  of  the  underlying  probability  distributions).  In  addition,  the  Kolmo¬ 
gorov  theory  assumes  that  there  exists  an  interval,  the  so-called  inertial  range, 
which  extends  from  a  small  length  (the  Kolmogorov  dissipation  scale  r/),  to  a 
large  one  (the  integral  scale  p)  such  that,  for  r  €  [q,p], the  variation 

s{r)  =  (|u(x  +  t/)  -  u(x)p)^ 

of  velocity  across  a  distance  r  —  \y  \  can  be  determined  from  dimensional  analysis: 

s(r)  ~  (er)^  . 

This  is  the  Kolmogorov  two-thirds  law  (two-thirds  because  one  has  5^^(er)3). 
It  is  one  of  the  most  important  predictions  of  this  theory.  It  implies  that 
the  energy  spectrum  behaves  approximately  like  a  —  |  power  in  a  range  of  wave 
numbers.  There  exists  experimental  numerical  and  theoretical  evidence  that 
seems  to  indicate  the  possibility  of  small  departures  from  simple  scaling.  More 
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precisely,  the  equal  time  generalized  structure  functions  (see  below)  scale  with 
distance  like  powers 

<  \u{x  +  y,t)  -  u{x,t)\^  >^rs^  U 

where  the  value  of  the  exponents  is  close  to  |  but  depends  on  m.  The  Cm 
must  be  nonincreasing  in  m  because  of  the  Holder  inequality. 

I  will  describe  here  the  results  of  ([14],  [22])  regarding  the  scaling  of  velocity 
structure  functions  for  the  Navier-Stokes  equations.  We  consider  ensembles  of 
solutions  of  the  Navier-Stokes  equations  in  the  whole  3-D  space.  We  assume  that 
there  exist  uniform  bounds  for  the  velocities  in  the  ensembles 

sup  |w(a:,  i)|  <  C/  (*) 

x,t 

This  assumption  implies  regularity  of  the  solutions.  We  will  consider  driving 
body  forces  /  which  are  bounded  uniformly. 

sup  \f{x,t)\  <  B  (**) 

x,t 

The  forces  are  deterministic.  We  assume  also: 

where  is  defined  below. 

These  are  the  standing  assumptions.  There  are  no  assumptions  of  homogene¬ 
ity  or  isotropy. 

We  define  the  averaging  procedure  Mp  by 

13  f 

Mp{h{x,t))  =  AY  sup  lim  sup  -y- — ^  /  /  h{x,t)dxdt 

xoeR\  T->oo  T  (4V)  Jq  Jb,{xo) 

AY  means  ensemble  average  and  sup^^  is  a  supremum  over  all  Euclidean  balls 
Bp  with  center  xq  and  radius  p. 

We  set 


and 


We  denote 


If 


e(p)  =z/M£  (|VM(a:,f)p), 


^mHv)  =  lMp(\u{x  +  y,t)  -u[x,t)\^)]^  . 
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holds  on  an  interval 


r 

then  we  say  that  we  have  m-scaling.  This  definition  implies  that  the  local 
Reynolds  number  ranges  between  c  and  CRe  in  the  scaling  region. 

Here  are  the  main  results. 

Theorem  2.  Assume  Then 


Hp)  ^ 


C/2 


Note  that  the  dissipation  is  bounded  uniformly  as  u  0.  Thus,  the  Kol¬ 
mogorov  dissipation  law  is  true  as  an  upper  bound.  A  lower  bound  is  much 
more  difficult  to  obtain;  as  far  as  I  know  it  is  still  an  open  problem. 

We  show  that  structure  functions  for  the  pressure  are  bounded  in  terms  of 
structure  functions  for  the  velocity.  Once  the  pressure  is  controlled  then  the  re¬ 
sult  follows  via  local  energy  inequalities.  We  relate  the  second  structure  function 
to  the  first: 

Theorems.  (*),(**),(**  *).  Then 

s[^\y)  <  cl7^Re5 


In  particular,  if  1  and  2-scaling  occur  then 


This  is  seen  as  a  result  deduced  in  the  neighborhood  of  the  dissipation  scale. 
Without  the  postulate  that  scaling  persists  down  to  lengths  where  the  corre¬ 
sponding  local  Reynolds  number  is  order  one  we  can  not  assert  a  direct  relation 
between  and  C2' 

It  is  widely  believed  that  <3  =  1*  The  evidence  is  both  numerical  and  the¬ 
oretical.  The  traditional  theoretical  arguments  are  based  on  the  assumption  of 
homogeneity  and  isotropy  ([15]).  A  conjecture  of  Onsager  corresponds  to  the 
statement  that  Cs  >  |  implies  6  =  0;  a  mathematical  formulation  and  a  proof  of 
this  conjecture  ([23])  offer  additional  arguments  in  support  of  (^3  =  ^.  Here  is  a 
brief  description  of  the  result  in  ([23]).  Denote  first 

We  =  y  0e(a;  -  y)u{y)dy 
a  standard  mollification  of  u.  Then  denote  by 

r^(w,w)(x)=  f  4>e(y){^yu{x)®6yu{x))dy 


where 


6yu{x)  =  u{x  -y)-  u(x). 
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The  following  identity  holds  pointwise: 

{U  (8>  =  lie  0  tig  +  u)  —  {u  —  tig)  (g)  (ti  —  tig). 

Theorem  4.  If  u  is  a  weak  solution  of  Euler  ^s  equation  which  belongs  to  the 
space  L^((0,  T);  H3 ’^)  for  some  o:  >  ^  then 

forQ<t<T. 

The  proof  uses  the  well-known  facts  about  Besov  spaces: 

||w-«£||l3  < 

ll^yw||L3  <  C\yn\uh^.^, 

and 

||Vn,|U3  < 

It  follows  that 

lln(M,w)ll^j  <  Ce^“||u||g«.«.. 

The  proof  follows  from  the  identity  and  from 

li  d  f 

=y  Tr((w(8iw),(Vue))rfa; 

by  the  ususal  incompressible  cancellation  and  by  straightforward  Holder  inequal¬ 
ities.  One  has  to  mollify  in  time  but  that  is  not  a  problem. 

Corrections  to  the  Kolmogorov- Obukhov  exponent  of  ^  ^  are  referred  to 

by  the  name  of  intermittency.  They  are  believed  to  be  connected  to  the  exis¬ 
tence  of  statistically  significant  large  variations  in  the  velocity  gradients,  over 
small  regions  in  physical  space.  We  have  some  mathematical  evidence  for  this 
connection: 

Theorem  5.  Assume  (*),(**),(***),  and  4~scaling.  If 

vME{\Vu{x-\-y,t)  —  Vu{x^t)\^)  >  c€(£)  >  0 

for  some  y  satisfying 

<  C(£e)-^ 

P 


then 
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Thus,  if  the  gradient  can  be  decorrelated  by  translations  over  very  small  dis¬ 
tances  then  intermittency  sets  in.  Note  that  |  is  the  value  of  the  exponent 
of  the  Kolmogorov  length.  If  ^3  =  |  and  if  ("4  makes  sense,  then  it  must  be 
equal  or  less  than  If  /?  =  |  then  our  result  is  consistent  with  C4  =  I?  he?  with 
the  K41  theory,  li  (3  >  \  however,  then  the  preceding  result  gives  sufficient  and 
testable  conditions  for  intermittency  corrections. 


5.  The  inviscid  limit 

Here  we  consider  the  inviscid  limit,  that  is  we  fix  time  and  let  the  control 
parameter  tend  to  infinity.  Let  us  focus  on  the  Navier- Stokes  equation.  The  first 
question  is:  what  are  the  limiting  equations?  In  realistic  closed  systems  where 
the  boundary  effects  are  important,  unstable  boundary  layers  drive  the  system: 
the  limit  is  not  well  understood.  In  the  case  of  no  boundaries  (periodic  solutions 
or  solutions  decaying  at  infinity)  the  issue  becomes  one  of  smoothness  and  rates 
of  convergence.  Indeed  in  n  =  2,  if  the  initial  data  are  very  smooth  then  the 
limit  is  the  Euler  equation  and  the  difference  between  Navier-Stokes  solutions  and 
corresponding  Euler  solutions  is  optimally  small  However,  if  the  initial 

data  are  not  that  smooth,  for  instance  in  the  case  of  vortex  patches,  then  the 
situation  changes.  Vortex  patches  are  solutions  whose  vorticity  (antisymmetric 
part  of  the  gradient)  is  a  step  function.  They  are  the  building  blocks  for  the  phase 
space  of  an  important  statistical  theory  ([24],  [25]).  When  one  leaves  the  realm 
of  smooth  initial  data  the  inviscid  limit  becomes  more  complicated:  internal 
transition  layers  form  because  the  smoothing  effect  present  in  the  Navier-Stokes 
solution  is  absent  in  the  Eulerian  solution.  In  the  case  of  vortex  patches  with 
smooth  boundaries,  the  inviscid  limit  is  still  the  Euler  equations,  but  there  is  a 
definite  price  to  pay  for  rougher  data:  the  difference  between  solutions  (in  L^) 
is  only  0{^/u)  ([26]).  This  drop  in  rate  of  convergence  actually  occurs  -  there 
exist  exact  solutions  providing  lower  bounds.  The  question  of  the  inviscid  limit 
for  the  whole  phase  space  of  the  statistical  theory  of  [24]  and  [25]  is  open.  If  the 
initial  data  are  more  singular  then  even  the  classical  notion  of  weak  solutions 
for  the  Euler  equations  might  need  revision  ([27])  except  when  the  vorticity  is 
of  one  sign  ([28]). 

In  the  case  of  three  spatial  dimensions  and  smooth  initial  data,  the  inviscid 
limit  is  the  Euler  equation  as  long  as  the  corresponding  solution  to  the  Euler 
equation  is  smooth  ([29], [30])  This  might  be  a  true  limitation,  because  of  the 
possibility  of  finite  time  blow-up,  The  blow-up  problem  for  the  Euler  {u  —  0) 
equations  is  the  following:  do  smooth  data  (for  instance  /  =  0,  and  smooth, 
rapidly  decaying  initial  velocity)  guarantee  smooth  solutions  for  all  time?  The 
answer  is  known  to  be  yes  only  for  n  =  2,  not  known  for  n  —  3. 

The  three  dimensional  incompressible  Euler  equations  are  equivalent  to  the 
requirement  that  two  vector  fields  Q  and  Dt  commute: 
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[A,O]  =  0. 

Both  vector  fields  are  associated  to  divergence  free  vectors.  The  first  one,  Dt  is 
associated  to  the  velocity  field  u  =  u{x^t): 


Dt=dt+  u{x,  t)  •  V 


The  second  vector  field,  is  associated  to  the  vorticity  u  =  uj{x,  t): 

0  =  uj{x,t)  •  V. 

The  integral  curves  of  O  are  vortex  lines.  The  commutation  relation  is  equivalent 
to  the  stretching  equation 

DfLj  =  (jj  •  Vu. 

Many  incompressible  inviscid  hydrodynamical  models  are  obtained  from  the 
commutation  relation  by  specifying  a  coupling  relation  between  the  coefficients 
u  and  uj  of  Dt  and  0.  In  many  cases  this  coupling  has  the  form 

u  = 

The  3D  Euler  coupling  kernel  JC  is  given  by  the  Biot-Savart  law 

i^(Euler)/  N  Vk 

Rj 

where  V  —  Rk  is  the  signature  of  the  permutation  (1,2,3)  j— >  and 

repeated  indices  are  summed.  Note  that  the  singularity  at  the  origin  is  of  the 
order  2  =  n  —  1.  A  natural  analogue  of  the  three  dimensional  Euler  coupling  in 
two  dimensions  is  ([14],  [33],  [32]) 

^(QGASE)  /  s  _  .  J_ 

Rj  {y}  ~  otj  1^1 

This  defines  the  quasi-geostrophic  active  scalar  equation  (QGASE) .  Its  coupling 
kernel  has  the  analogous  singularity  strength  1  =  n—1.  The  QGASE  is  physically 
significant  in  its  own  right:  it  is  a  model  for  temperature  in  a  quasi-geostrophic 
(Coriolis  forces  balance  pressure  gradients)  approximation  of  atmospheric  flow. 
We  will  discuss  simultaneously  the  3D  Euler  equations  and  QGASE.  The  nota¬ 
tion  is  purposely  ambivalent.  Because  the  QGASE  systems  are  2D  they  can  be 
described  by  the  time  evolution  of  scalars,  functions  0{x^t)  that  obey 

(^t  -h  w  •  V)  ^  =  0 

where  u  is  obtained  from  u  as  above  -and  0  is  related  to  cu  by 

a;  =  V-^<9. 

(V-^  denotes  the  gradient  rotated  counter-clockwise  by  90  degrees).  The  integral 
lines  of  fl  are  material  i.e.,  they  are  carried  by  the  flow.  The  QGASE  analogues 
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of  vortex  lines  are  iso-^  lines.  Their  length  element  is  |a;|.  They  stretch  according 
to 

Dt\co\  -  a\u;\. 

It  is  well-known  ([31]  for  the  3D  incompressible  Euler  equations,  but  the  result 
holds  for  QGASE  as  well)  that 

is  a  necessary  and  sufficient  condition  for  blow-up. 

It  is  also  easy  to  prove  that  the  singularities  cannot  occur  without  small  scales 
developing  ([14])  in  a;.  By  that  I  mean  that  that  large  spatial  gradients  of  its 
magnitude  must  develop,  at  a  fast  enough  rate.  This  is  related,  perhaps,  to 
intermittency  (see  previous  section). 

The  stretching  rate  alpha  is  given  by 


a{x)  =  {Vu{x))^{x)-^{x) 
and  the  direction  field  ^  by 


f(x)  = 


w(a:) 


kwr 

The  region  {x  ;  lc(;(ir)|  >  0}  is  material.  Both  a  and  ^  are  defined  in  it.  The 
stretching  rate  a  has  a  remarkable  integral  representation 

a(a:)  =P.  V.  j  D  {y,i{x  +  y),^{x))\u{x  +  y)\^. 

Here  n  is  the  dimension  of  space  (3  for  Euler  equations,  2  for  QGASE)  and 
■D(Euler)  (y,^(a:  +  2/),|(a;))  =  ((y  •^(x))Det(t/,f(a;  +  j/),^(x))) 


and 

(y,^{x  +  y),i{x))  =  ((^■^■^(a:;))(?(a;  +  2/)-^-^(x))). 

Note  that  the  geometric  factors  D  vanish  not  only  in  the  spherical  average,  but 
also  if  the  vectors  Ox -\-y)  and  ^(a:)  are  parallel  or  anti-parallel.  More  precisely, 
if  cos  (j)  —  ^{x)'  ^{x  +  y)  then  \D\  <  |  sin(/)|.  The  geometric  factors  vanish  also  if 
y  is  parallel  to  ^(a;). 

Based  on  these  properties  we  prove  ([32],  [36])  that  if  the  direction  field  ^  is 
smooth  in  regions  of  high  |a;|  then  blow  up  does  not  occur. 

Let  u  be  the  velocity  of  either  a  three  dimensional  incompressible  Euler  so¬ 
lution  with  smooth  and  localized  initial  data  or  of  a  two  dimensional  QGASE 
solution  with  smooth  and  localized  initial  data.  Assume  that  the  corresponding 
solution  is  smooth  on  a  time  interval  0  <t  <T.  The  velocity  field  u  —  u{x,  t)  is 
used  to  define  particle  trajectories  X{q^t)^  solutions  of 

dX 
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X(g,0)  =  g. 


If  Wo  is  a  set  we  denote  by  Wt  its  image  at  time  t 


Wt  =  X{t,Wo) 

Br(yV)  denotes  the  neighborhood  of  W  formed  with  points  situated  at  Euclidean 
distance  not  larger  than  r  from  W.  We  say  that  a  set  Wq  is  smoothly  directed  if 
there  exists  p  >  0  and  r,  0  <  r  <  |  such  that  the  following  three  conditions  are 
satisfied.  First,  for  every  q  6  Wq  , 


WS  =  {qeWo;\u;oiq)\^0}, 


and  all  t  G  [0,  T),  the  function  ^{-jt)  has  a  Lipschitz  extension  (denoted  by  the 
same  letter)  to  the  Euclidean  ball  of  radius  4p  centered  at  X(q,  t)  and 


M  = 


lim  sup 
qew, 


0  *^0 


00. 


Secondly, 

sup  \u){x,t)\<m  sup  \(jj{Xjt)\ 

BsAWt)  BAWt) 

holds  for  all  t  E  [0,  T)  with  m  >  0  constant.  And  finally. 


sup  |w(x,t)|<C/ 

B4p(Wt) 

holds  for  all  t  E  [0,  T). 

The  first  assumption  means  that  the  direction  field  ^  is  well  behaved  in  a 
fixed  neighborhood  of  a  bunch  of  trajectories.  The  second  assumption  states 
that  this  neighborhood  is  large  enough  to  capture  the  local  intensification  of  |a;|. 
In  other  words,  the  competing  and  significantly  stronger  blow  up  trajectories,  if 
they  exist,  do  not  come  near  this  bunch.  Our  regularity  result  is 


Theorem  6.  Assume  Wq  is  smoothly  directed.  Then  there  exists  r  >  0  and 
T  such  that 

sup  \u{xA)\  sup  |a;(a:,to)| 

BriWt)  BAWt^) 

holds  for  any  0  <to  <T  and  0  <t  —  tQ  <t. 


The  numerical  evidence  for  QGASE  ([32])  supports  strongly  two  statements: 
first  that  sharp  fronts  do  form  in  finite  time  (sharp  means  large  gradients  of  9, 
i.e.  large  |a;|;  based  on  the  present  computations,  blow  up  cannot  be  predicted). 
Secondly  that  there  is  a  marked  difference  in  the  rate  of  development  of  these 
sharp  fronts  caused  by  the  nature  of  There  exist  initial  data  for  which  ^ 
develops  only  antiparallel,  regularly  directed  singularities.  The  formation  of 
fronts  is  then  depleted.  For  other  initial  data,  a  saddle  point  in  0  provides  a 
Lipschitz  singularity  in  the  ^  direction  field.  This  is  the  source  of  much  more 
intense  gradient  growth.  Qualitatively  similar  numerical  results  were  obtained 
also  for  the  Euler  equations  ([37]). 
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Now  I  will  address  briefly  the  role  of  viscosity.  In  the  three  dimensional 
incompressible  Navier-Stokes  equations  there  exist  suitable  weak  solutions  (this 
is  a  technical  term)  that  satisfy 

<  |a;(a;,<)||Vf(a:,f)|^  >  <  ^ 

where  i/  >  0  is  the  viscosity,  T  is  given  in  terms  of  the  initial  data  and  <  •  •  *  >  is 
an  appropriate  space  and  time  average  ([35],  [34]).  Consequently,  typical  regions 
of  high  vorticity  have  Lipschitz 

Moreover,  if  one  assumes 

Assumption  (A) 

There  exist  constants  Q  >  0  and  p  >  0  such  that 

holds  if  both  \u{x,  ^)|  >  0  and  \uj{x  -h  t/,  i)|  >  0,  and  0  <t  <T. 

(P^-^^)^(a:  +  y)  is  the  projection  oi  ^{x  +  y)  orthogonal  to  ^(x)),  then  ([34]): 

Theorem  7.  Under  the  assumption  (A)  the  solution  of  the  initial  value  prob¬ 
lem  for  the  Navier-Stokes  equation  is  smooth  (C^)  on  the  time  interval  [0,  T]. 

Based  on  these  theoretical  considerations  and  on  the  numerical  evidence  on 
active  scalars  ([32])  and  on  Euler  equations  ([37])  we  speculate  that  blow  up 
occurs  in  the  Euler  equations  and  that  this  event  occurs  in  the  neighborhood 
of  a  singular  object  in  physical  space  (such  as  a  vortex  tube  pair  with  a  conical 
singularity).  We  continue  to  speculate  that  for  the  Navier-Stokes  equation  these 
events  are  not  blow-up  events  but  trigger  topological  change  and  large  dissipation 
events;  their  presence  is  felt  at  the  dissipation  scales  and  is  perhaps  the  source 
of  small  scale  intermittency. 
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Introduction 

The  inertial  scales  of  three-dimensional  fully  developed  turbulence, 
which  mediate  between  the  energy-containing  range  and  the  dissipa¬ 
tion  range,  are  a  main  focus  of  turbulence  theory.  They  exhibit  the 
well-known  Kolmogorov  spectrum,  E{k)  =  constant-es  ,  where 
E{k)  is  the  energy  spectrum  (energy  as  a  function  of  the  wave  num¬ 
ber  k)  and  e  is  the  rate  of  energy  dissipation  per  unit  mass.  Most 
derivations  of  this  spectrum  rely  on  the  idea  of  an  energy  cascade: 
energy  is  fed  into  the  system  at  large  scales,  it  is  dissipated  at  small 
scales,  and  in  the  inertial  range  the  energy  flow  is  not  only  irreversible 
but  also  far  from  a  Gibbsian  thermal  equilibrium.  Thermal  equilib¬ 
rium  for  the  equation  of  fluid  mechanics  is,  on  the  other  hand,  iden¬ 
tified  with  an  “equipartition”  ensemble,  in  which  the  mean  energy 
per  Fourier  mode  is  constant,  and  the  resulting  spectrum  is  propor¬ 
tional  to  —  a  very  different  spectrum  indeed.  This  general  picture 
has  survived  in  the  literature  (see  e.g.  [28])  even  though  the  idea  of 
a  unidirectional  cascade  contradicts  experiment  [29],  closely  related 
fields  (e.g.  plasma  turbulence  [19],  superfluid  turbulence  [17])  do  not 
exhibit  such  a  chasm  between  equilibrium  and  non-equilibrium  phe¬ 
nomena,  recent  analyses  of  two-dimensional  turbulence  make  ample 
use  of  equilibrium  ideas,  and  the  “equipartition”  ensemble  is  of  doubt¬ 
ful  relevance  to  fluid  mechanics.  Indeed,  turbulence,  as  it  is  usually 
described,  is  the  only  major  example  of  an  irreversible  system  with 
many  degrees  of  freedom  which  cannot  be  connected  in  a  significant 
way  to  an  appropriate  equilibrium  process  (e.g.,  via  a  fluctuation- 
dissipation  theorem).  It  has  already  been  argued  in  detail,  e.g.  in 
[10],  that  the  “equipartition”  (or  “Hopf’)  equilibrium  is  an  artifact 
produced  by  divergent  spectral  approximations  and  has  no  relevance 
to  fluid  mechanics,  and  this  argument  will  not  be  repeated  here.  We 
shall  emphasize  instead  the  contrary  point  of  view,  according  to  which 
turbulence  can  indeed  be  usefully  studied  as  a  small-to-moderate  per¬ 
turbation  of  a  Gibbsian  thermal  equilibrium  in  vortex  (or  “magnet”) 
variables.  The  idea  is  sensible  because  it  is  plausible  that  the  small 
scales  of  turbulence  have  intrinsic  time  scales  much  smaller  than  the 
time  scale  of  overall  decay,  and  there  is  enough  time  for  equilibrium 
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to  be  reached.  The  emphasis  here  is  on  the  motivation  and  on  the 
plausibility  of  the  framework  presented  rather  than  on  detailed  cal¬ 
culations,  especially  when  the  latter  are  already  available  in  print. 
Note  that  except  possibly  in  the  next  section,  “equilibrium”  means 
“Gibbsian  equilibrium”  and  not  “statistical  steady  state” . 


Example  of  a  cascade  through  an  equilibrium  spectrum 

In  subsequent  sections  we  shall  present  models  whose  spectrum  is 
determined  by  an  equilbrium  ensemble  and  which  also  exhibit  an  en¬ 
ergy  transfer  between  large  and  small  scales.  The  usual  belief  is  that 
equilibrium  and  turbulent  energy  transfer  are  mutually  exclusive,  un¬ 
less  “equilibrium”  means  “statistical  steady  state”.  In  the  present 
section  we  present  a  simple  example  of  a  system  where  an  irreversible 
energy  cascade  proceeds  across  a  spectrum  whose  form  is  determined 
by  what  can  be  reasonably  viewed  as  “thermal  equilibrium”  considera¬ 
tions,  and  where  in  addition  there  is  a  relation  between  the  amplitude 
of  the  energy  spectrum  and  a  rate  of  energy  dissipation.  More  elab¬ 
orate  examples  of  systems  with  similar  behavior  are  available  in  [12]. 
In  no  way  do  we  imply  that  there  is  never  a  difference  between  sta¬ 
tistical  steady  states  and  Gibbsian  thermal  equilibrium-  only  that  it 
is  possible  for  the  spectral  distribution  of  the  energy  to  be  similar  in 
both  cases. 

Consider  the  unit  square  T)  =  {x,y  |  0  <  x  <  1,  0  <  ?/  <  1}; 
we  are  going  to  implement  on  D  a  discrete  version  of  the  baker’s 
transformation  (see  e.g.  [30]):  x'  =  2x  —  [2x],  y'  =  |(x  +  [2x]), 
where  [x]  denotes  the  largest  integer  <  x.  This  transformation  is  well 
known  to  be  ergodic  and  mixing,  and  thus  not  an  absurd  example  of 
the  growth  in  disorder  typical  of  turbulence. 

To  discretize  the  transformation,  divide  D  into  boxes  Dij  =  {x,  t/  | 
ih  <  X  <  jh  <y  <  {j  +  where  h  ~  n  =  2^  for  some 

integer  and  takes  the  values  0, 1, . . . ,  (n  —  1).  Furthermore, 
consider  the  discrete  function  0  =  (l>ij  =  0  or  1  on  each  Dij. 
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The  discrete  baker’s  transformation  (z,  j)  ->  (z',  j')  is  defined  by 
x  —  ih,  y  =  jh  , 

x'  =  2x-  [2x]  ,  z/'  =  \{y  -h  [2x])  ,  (1) 

i'  =  x'/h,  f  =  y^lh. 

One  can  readily  check  that  this  transformation  maps  two  pairs  (z,  j) 
on  one  (z',jf')  if  z'  is  odd  and  no  pairs  if  z'  is  even.  To  construct 
the  image  (f)^  of  (j)  under  the  transformation,  consider  for  z'  odd  the 
sum  s  of  the  <f)ij  at  the  preimages  of  {i'J'),  and  set  (j)i'j'  =  0  if 
5  =  0,  =  1  if  s  >  1.  Then,  if  s  <  1  set  =  0,  if 

5  =  2  set  =  1-  This  construction  preserves  J  (j)dxdy  (and 

indeed,  J  (j)'^dxdy  for  all  q).  The  use  of  an  integer^  valued  function  (j) 
is  designed  to  eliminate  any  entropy  increase  due  to  smoothing.  It  is 
easily  seen  that,  given  an  initial  (j)  with  few  “holes”,  the  first  few  steps 
of  the  transformation  (1)  approximate  well  the  first  few  steps  of  the 
continuous  baker’s  transformation.  However,  the  sequence  of  discrete 
maps  of  (j>  is  periodic  with  period  2m. 

Start  with  initial  data  (j)  with  few  “holes”,  for  example,  with  a 
function  (p  which  equals  1  in  some  ih  x  £h  square,  £  integer,  £  <  n. 
This  (j)  and  its  discrete  baker’s  transforms  can  be  expanded  in  Fourier 
or,  even  better,  Haar  series.  For  the  first  few  steps,  the  “energy” 
moves  to  smaller  and  smaller  scales,  as  in  simple  models  of  turbulent 
cascades.  In  the  next  m  steps  the  energy  comes  back;  the  time  aver¬ 
age  of  the  energy  per  scale  is  a  constant  independent  of  scale.  One 
can  readily  find  initial  data  for  which  the  energy  starts  out  roughly 
equidistributed  among  scales  and  remains  so.  We  shall  not  bother  to 
define  a  temperature  and  an  entropy  for  this  system  and  shall  iden¬ 
tify  the  “equidistribution”  ensemble  with  a  thermal  equilibrium;  for 
details,  see  [12];  the  energy  spectrum  has  the  form  E{k)  =  AkP^ ,  with 
^  =  0,  and  its  amplitude  A  depends  on  the  mean  amplitude  of  the 
initial  data. 

Now  add  an  irreversible  energy  cascade  to  this  model:  Start  with 
“smooth”  initial  data,  for  example  (pij  I  ior  i,j  <  I,  I  =  [\/0n]^ 
where  (3  —  T,(l>ijln?  <  1  is  the  fraction  of  sites  where  (pij  =  1.  Re¬ 
move  energy  at  small  scales  by  setting  (pij  to  0  whenever  (pij  1  and 
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<l>i±i,j  =  0,  =  0  (i.e.,  whenever  there  is  an  isolated  1).  Feed 

energy  at  large  scales  by  filling  in  the  missing  1  ’s  whenever  the  initial 
data  would  have  been  recovered  if  it  were  not  for  the  removal,  i.e., 
once  every  2m  steps.  Define  a  rate  of  energy  dissipation  e  as  the  ra¬ 
tio  of  the  number  of  I’s  removed  per  q  steps  divided  by  q  (the  limit 
^  >  00  is  reached  in  2m  steps).  It  is  easy  to  see  that  the  form  of  the 
spectrum  is  unchanged,  and  that  for  small  P,  A  =  26  o(/?),  i.e., 

E{k)  =  2ek~'^.  (The  dependence  of  A  on  e  is  more  interesting  near 
/?  =  ^,  but  this  is  not  the  place  to  discuss  it.)  One  can  reverse  the 
usual  argument  and  say  that  here  (and  possibly  in  turbulence  theory) 
there  is  a  relation  between  E  and  e  not  because  energy  dissipation 
creates  the  spectrum  but  conversely,  because  the  more  energy  there 
is  in  the  “equilibrium”  scales,  more  of  it  can  be  dissipated.  Note  that 
the  dissipation  mechanism  described  for  the  baker’s  transformation 
is  in  fact  a  faithful  representation  of  the  effect  of  the  scales  omitted 
by  the  discretization.  The  recurrent  behavior  of  the  system  is  not 
without  parallel  in  vortex  dynamics  [34]. 

Equilibria  of  vortex  filaments 

We  now  consider  thermal  equilibria  of  vortex  filaments  and  their 
relevance  in  turbulence  theory.  The  steps  are 

(i)  analysis  of  the  equilibrium  configurations  of  a  single  vortex 
filament, 

(ii)  analysis  of  a  many-vortex  collection,  including  numerical  re¬ 
sults  and  an  analytical  description  based  on  a  low  fugacity, 
i.e.,  sparse  system  theory  of  Kosterlitz-Thouless  type,  and 

(iii)  a  brief  mention  of  the  modifications  needed  to  accommodate 
an  energy  cascade. 

The  passage  from  a  many-vortex  system  to  a  continuum  is  well  ana¬ 
lyzed  in  other  contexts  (for  a  review,  see  [31])  but  not  in  the  statistical 
context,  and  is  probably  the  weakest  link  in  what  remains  a  specula¬ 
tive  chain.  The  main  points  in  the  chain  are:  there  exist  stable  equi¬ 
libria  that  attract  vortex  systems  embedded  in  a  three-dimensional  in¬ 
compressible  flow;  these  vortex  equilibria  have  Kolmogorov-like  spec¬ 
tra  (and  higher  moments);  appropriate  irreversible  (but  not  one-way) 
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cascades  live  in  the  vicinity  of  these  equilibria. 

First,  a  common  misapprehension  should  be  dispelled:  the  vortices 
and  vortex  filaments  we  are  dealing  with  can  be  used  to  represent 
an  arbitrary  incompressible  flow,  under  wide  conditions  well  studied 
in  numerical  analysis;  they  are  not  the  coherent,  organized  vortices 
one  sees  in  some  turbulent  flows.  A  coherent,  organized  vortex  is  a 
collective  mode  of  this  many-vortex  system.  The  vortex  representa¬ 
tion  is  general  and  requires  no  d  priori  assumptions  about  the  role 
of  organized  structures  in  turbulence.  A  similar  situation  holds  in 
two-dimensional  vortex-based  analyses  (reviewed  e.g.  in  [10], [18]). 


A  single  vortex  filament 


The  construction  in  this  section  is  amply  described  elsewhere,  see 
e.g.  [8], [10].  Consider  a  single  long  vortex  filament,  with  some  small, 
finite,  constant  cross-section.  It  is  immaterial  whether  the  filament  is 
open  (a  topological  cylinder)  or  closed  (a  topological  doughnut).  The 
energy  of  a  compactly-supported  vorticity  field  ^  ==  ^(x)  is  given  by 
the  integral 

where  |x|  is  the  length  of  the  vector  x.  To  specialize  this  integral 
to  the  case  of  a  filament,  suppose  the  filament  center-line  can  be 
approximately  mapped  on  a  connected  set  of  N  bonds  of  a  cubic 
lattice;  then  E  is  approximated  by 

i 


where  is  a  vector  centered  at  the  center  of  the  lattice  bond  occupied 
by  the  z-th  piece  of  the  filament,  parallel  to  that  bond,  pointing  in  the 
same  direction  as  ^  in  the  bond,  with  |ti|  =  circulation  of  the  vortex 
divided  by  \/^,  \i—  j\  is  the  straight-line  distance  between  segments 
i  and  jf,  and  p  is  that  portion  of  the  integral  in  (2)  which  corresponds 
to  x,x'  in  the  same  bond.  The  parameter  p  can  be  interpreted  as 
a  chemical  potential.  The  vortex  filament  has  many  configurations. 
Assign  to  each  the  probability  Z~^  exp{~(3EN),  where  Z  is  the  ap¬ 
propriate  normalization  factor,  /?  =  l/T,  and  T  is  the  temperature. 
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As  in  two  space  dimensions,  T  can  be  positive  or  negative;.  T  <  0  is 
“beyond  infinity”  rather  than  below  0;  negative  temperatures  occur 
whenever  the  maxima  of  the  entropy  and  of  the  energy  fail  to  coin¬ 
cide.  In  the  limit  N  ^  oo,  the  vortex  filament  is  in  one  of  three 
states:  For  /?  >  0  it  is  balled  up  into  a  crumpled  ball,  for  /?  <  0  it 
is  a  straight  line,  and  for  /?  =  0  all  configurations  are  equally  likely 
and  one  has  a  “polymer”  —  a  self-avoiding  equal  probability  random 
walk.  The  heuristics  of  this  situation  are  simple:  /?  >  0  favors  low 
energy  states,  and  low  energy  for  a  vortex  is  obtained  by  folding  it 
and  allowing  the  velocity  fields  induced  by  its  several  pieces  to  cancel; 
the  converse  holds  for  /?  <  0.  In  the  boundary  case  /3  =  0  the  en¬ 
ergy  spectra  has  the  form  E{k)  k~'^ .  For  an  infinitely  thin  vortex 
7  =  D',  the  vector- vector  correlation  exponent  for  a  polymer,  which 
has  been  calculated  in  several  ways;  a  plausible  conjecture  by  Akao 
[1]  yields  —  2  —  where  a  is  the  Flory  exponent  for  a  polymer, 
Q:  =  .59  and  thus  7  =  .3.  (A  related  conjecture  has  been  offered  by 
Shenoy  [6]).  For  a  filament  with  a  finite,  non-uniform  cross-section,  a 
heuristic  analysis  yields  7  =  5/3,  a  Kolmogorov-like  result  (note  that 
this  Kolmogorov-like  result  is  found  at  equilibrium). 


Consider  now  a  smooth  physical  vortex  with  some  finite  cross- 
section.  It  can  be  modeled  as  a  finite  collection  of  segments,  at  a 
negative  /3.  If  the  vortex  is  imbedded  in  a  random  flow  it  will  pre¬ 
sumably  stretch;  if  its  energy  is  conserved  one  can  see  that  \(3\  will 
decrease  until  /?  =  0  is  reached,  assuming  one  can  represent  the  evo¬ 
lution  of  the  vortex  as  a  succession  of  equilibrium  states.  Energy 
conservation  forbids  the  crossing  of  the  boundary  /?  =  0,  which  is 
therefore  attracting.  We  thus  have  a  simple  near-equilibrium  cartoon 
of  the  effect  of  vortex  stretching  and  of  the  reason  for  the  appearance 
of  a  Kolmogorov  spectrum.  In  two  space  dimensions,  by  contrast, 
there  is  no  stretching,  the  temperature  of  a  vortex  system  is  invariant 
(assuming  adiabatic  walls),  and  there  is  no  universal  spectrum.  Note 
that  here,  just  as  in  two  dimensions,  /3  =  0  is  the  boundary  between 
classical  and  non-claissical  (e.g.,  quantum)  hydrodynamics. 
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A  collection  of  vortex  filaments 

We  now  consider  many  filaments.  The  significant  parameters  in 
the  problem  are  /3  =  l/T,  as  before,  and  /i,  the  “chemical  potential”, 
defined  in  equation  (3).  The  equilibria  of  many- vortex  systems  have 
been  studied  numerically  (see  e.g  [2], [11] ,22]),  and  they  exhibit  a  phase 
transition  line  that  is  no  longer  on  the  fi  axis  =  0)  in  the 
plane.  As  before,  smooth  vortex  filaments  live  on  the  small  /?  side  of 
this  line  (which  includes  the  half  plane  /?  <  0,  since  negative  tempera¬ 
tures  are  “hotter”  than  positive  temperatures),  and  vortex  stretching 
draws  them  to  the  transition  line.  The  analysis  of  this  attraction 
provides  an  interesting  interpretation  of  intermittency  [7], [10]. 

To  understand  the  situation  analytically,  we  proceed  via  a  Kosterlitz- 
Thouless  (KT)  “dielectric”  analysis  [23], [24],  which  is  a  twoterm  ex¬ 
pansion  in  powers  of  the  fugacity  y  =  and  applies  to  sparse  (but 

no  longer  one-filament)  systems.  This  methodology  is  ambiguous  [14] 
and  to  simplify  the  presentation  here  we  shall  pick  its  easiest  variant 
[13];  the  KT  analysis  has  the  great  advantage  of  leading  naturally  to 
irreversible  extensions  of  the  equilibrium  theory. 

First,  following  [13], [27], [32],  we  simplify  the  vorticity  field  and  rep¬ 
resent  it  as  a  sparse  collection  of  circular  vortex  loops.  Recent  work, 
in  particular  by  Buttke  [4], [5],  shows  that  any  vorticity  field  can  be 
approximated  by  such  a  union  of  circular  loops  (also  known  as  “mag¬ 
nets”,  or  “elements  of  impulse”  or  “velicity”),  but  the  representation 
is  not  unique  (creating  problems  when  entropy  is  calculated)  and  the 
loops  are  generally  not  independent  (creating  problems  for  the  argu¬ 
ment  that  follows  here).  The  loop  representation  thus  constitutes  a 
substantative  simplification  [11]. 

Assume  now  that  in  some  part  of  the  (/?,  A^)  plane,  to  be  deter¬ 
mined,  large  loops  “polarize”  smaller  loops,  i.e.,  orient  them  in  a  spe¬ 
cific  way,  creating  a  “dielectric”  medium  [21].  In  the  simplest  form 
of  a  Kosterlitz-Thouless  argument,  one  assumes  that  there  exists  a 
dielectric  constant  e  and  that  the  response  of  the  system  of  loops  to 
an  imposed  velocity,  as  well  as  its  energy,  is  reduced  by  e.  (In  a  more 
detailed  theory,  e  is  a  function  of  scale.)  When  /?  >  0,  the  Gibbs  fac¬ 
tor  favors  lower  energies,  and  thus  arranges  smaller  loops  to  oppose 
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the  flow;  then  e  >  1.  It  is  customary  to  work  in  terms  of  the  param¬ 
eter  K  =  p I e.  Prom  general  statistical  mechanical  principles,  one 
can  calculate  the  susceptibility  of  a  vortex  loop  (the  mean  amount  by 
which  it  reduces  an  imposed  velocity  u,  divided  by  |u|),  as  well  as  the 
density  of  loops  of  each  radius  r,  and  by  adding  up  the  contribution 
of  all  loops  one  obtains  for  K  the  integral  equation 

/oo 

r®  exp(— /i))dr  ,  (4) 

where  E  =  E{t^  //)  is  the  energy  of  a  loop  of  radius  r,  which  depends 
on  fi.  6  is  the  thickness  of  a  loop,  assumed  small  but  finite.  The 
integral  equation  is  nonlinear  even  in  this  linear  theory  because  the 
effective  energy  depends  on  K,  The  integral  equation  (4)  defines 
a  function  K  =  iir(/3, /x),  as  well  as  a  set  of  renormalization  group 
transformations  which  do  not  concern  us  here.  The  plane  is 

divided  into  a  region  where  K  is  real  and  positive,  a  region  where  K 
(which  must  be  real)  is  not  defined,  divided  by  a  “transition  curve” . 

A  small  imaginative  leap  identifies  the  region  where  K  is  well  de¬ 
fined  with  the  “folded”  region  of  the  one- vortex  theory.  Indeed,  polar¬ 
ization  and  vortex  folding  are  similar;  one  can  observe  the  reduction 
of  the  effective  velocity  by  vortex  folding  and  bending  in  standard 
vortex  computations.  Similarly,  the  region  where  K  is  ill-defined  is 
the  region  in  which  vortex  lines  are  smooth  and  the  representation  by 
small,  independent  loops  is  inapplicable;  the  boundary  between  them 
is  the  phase  transition  line.  In  superfluid  theory,  the  “folded  region” 
is  the  region  in  which  there  is  a  superflow,  and  the  “unfolded  region” 
is  the  normal  fluid  region. 

To  make  the  theory  concrete,  consider  the  case  of  very  thin  vortex 
filaments,  S  «  1,  The  integral  equation  (4)  reduces  [13]  to 

rOO 

if-i  =  /?-i  +  Cl  y  r®  eM-C2Knr)dr  ,  (5) 

where  Ci,C2  are  known  constants,  and  fx  can  be  calculated  from  (2) 
once  6  is  known.  Following  an  argument  due  to  Hald  [20],  equation 
(5)  can  be  rewritten  in  the  form 

poo 

T  =  P  ^  =  f{K)  ,  f{K)  =  K~^—ci  I  exp{—C2KiJ,r)dr  . 

(6) 
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(Note  that  the  integral  can  be  evaluated  exactly.)  The  function  f{K)^ 
plotted  in  Figure  1  for  ci  =  C2  =  =  1,^  =  0,  is  convex  in  K  and 

has  a  single  maximum,  T^ax,  reached  at  a  point  K  =  Xmax-  For 
T  >  Tmax,  equation  (5)  has  no  real  solution.  The  chemical  potential 
AX  as  a  function  of  /3max  “  1/^max  is  plotted  in  Figure  2.  Note  that 
/?  <  0  is  to  the  right  of  >  0,  being  “hotter”.  This  phase  transition 
line  is  qualitatively  similar  to  the  computed  phase  transition  line, 
see  e.g.  [2], [11], [22].  The  trajectory  of  a  vortex  system  undergoing 
stretching  is  also  marked  in  Figure  2.  One  can  deduce  from  (5)  an 
inertial  range  spectrum  and  an  inertial  exponent,  which  is  not  exact, 
not  surprisingly  in  view  of  the  crudeness  of  the  model. 

A  further  conceptual  difficulty  has  to  be  faced  here:  arguments 
of  the  type  presented  above  are  familiar  in  the  context  of  superfluid 
vortex  systems,  where  vortices  are  generated  by  thermal  fluctuations 
and  can  reconnect  by  quantum  mechanisms.  We  are  applying  these 
arguments  to  classical  systems,  where  vortices  are  generated  by  vor¬ 
tex  stretching  and  reconnect  through  either  singularity  formation  or 
viscous  effects.  In  neither  case  is  it  obvious  that  thermal  equilibrium 
is  reached,  since  potential  barriers  must  be  forded  in  the  former  case 
and  topological  constraints  overcome  in  the  latter.  One  must  assert 
without  proof  a  “generalized  ergodic  principle”  [26] ,  to  the  effect  that 
whatever  the  obstacles,  equilibrium  is  reached  in  a  reasonable  time. 

One  obtains  from  all  this  not  only  a  qualitative  description  of 
equilibrium  vortex  distributions,  but  also  a  recipe  for  simplifying 
this  description.  Suppose  a  homogeneous  vortex  system  near  an  at¬ 
tracting  equilibrium  is  described  (for  example,  on  a  computer)  by 
means  of  a  large  number  of  vortex  loops.  Suppose  one  deletes  all 
loops  below  some  scale  d.  To  compensate  for  this  removal,  one  has 
to  endow  the  system  with  a  dielectric  constant,  e  =  P/K,  where 
^  i^--i  exp{-C2K fxr)dr,  Ci,C2  are  known,  /x  can  be 

found  from  (2),  /?  =  l/T^ax  and  K  =  ATmax  can  be  found  on  the  tran¬ 
sition  curve  by  identifying  the  maximum  of  f[K).  This  is  a  renormal¬ 
ization  of  the  system. 

Note  that  if  /x  — >  oo  the  system  becomes  sparse;  as  one  can  see 
from  Figure  2,  the  transition  curve  is  asymptotic  to  /?  =  0,  and  one 
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recovers  the  one-vortex  description  of  the  previous  section.  At  /?  =  0 
the  Gibbs  factors  are  all  unity,  the  polarizability  of  a  loop  is  zero,  and 
small  loops  can  be  removed  with  no  penalty.  One  recovers  the  hairpin 
removal  of  reference  [9]. 

The  modification  needed  to  handle  inhomogeneous  systems,  as  well 
as  practical  applications  of  these  constructions,  will  be  discussed  in 
[16]. 

Irreversibility  in  vortex  systems 

The  use  of  KT-like  theory  is  not  without  problems,  but  it  does 
provide  an  elegant  transition  to  the  study  of  “nearby”  irreversible 
process,  and  in  particular  energy  cascades.  It  is  quite  easy  to  add  to 
the  KT  formalism  terms  that  describe  a  finite  rate  of  relaixation  to 
equilibrium  for  vortex  loops  of  scale  r  in  the  presence  of  a  varying 
imposed  velocity.  The  “dielectric  function”  acquires  a  delay,  and  as  is 
well  known,  the  imaginary  part  of  the  Fourier  transform  of  the  delay 
function  represents  energy  loss  (the  Kramers-Kronig  formalism,  see 
e.g.  [25]).  As  is  already  known  from  superfluid  calculations  [3], [33], 
the  energy  loss  is  largest  near  the  phase  transition  line,  as  it  should 
be  from  the  discussion  above.  This  remark,  to  be  amplified  elsewhere 
[15],  connects  the  equilibrium  vortex  model  with  the  irreversible  as¬ 
pects  of  turbulence. 

Conclusions 

An  alternate  approach  to  turbulence  theory  has  been  outlined. 
It  can  be  viewed  as  a  generalization  to  three  dimensions  of  two- 
dimensional  vortex  equilibrium  theory  started  by  Onsager  and  so  ably 
developed  since.  It  provides  a  simple  explanation  for  the  universal¬ 
ity  of  the  Kolmogorov  spectrum  and  for  the  origins  of  intermittency. 
The  general  premise  can  be  restated  in  fairly  anodyne  terms:  Since 
some  kind  of  perturbation  theory  seems  to  be  unavoidable  in  turbu¬ 
lence  theory,  one  should  start  with  a  “ground  state”  that  is  as  close  as 
possible  to  the  final  truth.  Vortex  equilibria  near  the  vortex  critical 
line  may  well  offer  the  natural  starting  point,  as  they  already  present 
many  of  the  features  of  the  Kolmogorov  theory. 
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Fig.  2  Phase  transition  line  in  simple  KT  model. 
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HOMOGENIZATION  AND  RENORMALIZATION: 

THE  MATHEMATICS  OF  MULTI-SCALE  RANDOM  MEDIA 
AND  TURBULENT  DIFFUSION 
Marco  Avellcineda^ 

This  lecture  describes  some  mathematiccil  methods  developed  in  recent  years 
to  study  the  macroscopic  behavior  of  heterogeneous  media  and  diffusion  in  turbu¬ 
lent  fluids.  This  area  of  Applied  Mathematics  comprises  a  diversity  of  beautiful 
ideas.  These  include  the  averaging  of  partial  differential  equations,  or  homogeniza¬ 
tion  theory,  probabilistic  techniques  using  Brownian  motion,  and  recursive  scaling 
methods,  known  as  renormalization  group  techniques,  which  originated  in  High- 
Energy  Physics  and  in  Statistical  Physics  for  the  study  of  phase- transit  ions.  My 
goal  is  to  describe  these  ideas  “at  work”  on  a  classical  problem,  the  advection- 
diffusion  of  a  passive  scalar  in  a  multiple-scale,  random  velocity  field,  showcasing 
the  inter-relation  between  these  methods,  as  well  as  their  scope  and  limitations.  The 
applicability  of  homogenization  and  renormalization  methods  extends  well  beyond 
the  topics  of  this  talk  to  many  other  areas  of  Applied  Mathematics. 

I  present  an  outline  of  what  follows.  In  the  first  section,  I  briefly  describe  ho¬ 
mogenization  theory  and  its  application  to  computing  the  effective  conductivity  of 
a  heterogeneous  medium.  The  second  section  deals  with  the  effective  diffusivity  of  a 
passive  scalar  undergoing  ad vect ion- diffusion  in  an  incompressible  flow.  In  Section 
3,  I  discuss  the  method  of  iterated  homogenization  —  a  mathematical  approach  to 
evaluate  iteratively  the  effective  properties  of  multi-scale,  self-similar  media.  The 
rest  of  the  paper  focuses  on  advection-diffusion  exclusively,  concentrating  on  the 
phenomenon  of  anomalous  diffusion  due  to  “infrared- divergent”  velocity  fields.  Af¬ 
ter  formulating  the  renormalization  problem  in  Section  4,  I  present  two  examples 
in  which  renormalization  can  be  carried  out  in  full  mathematical  rigor.  These  ex¬ 
amples  illustrate  the  use  of  scaling  techniques  combined  with  homogenization  in 
the  same  problem.  In  particulcir,  the  mathematical  relation  between  decimation  of 
high-wavenumber  components  and  homogenization  is  analyzed.  Section  7  discusses 
the  renormalization  group.  Finally,  in  Section  8,  I  propose  an  approximate  solution 
to  the  (open)  problem  of  characterizing  the  large-scale  statistics  of  a  passive  scalar 
evolving  in  an  isotropic,  incompressible,  infrared-divergent  velocity  field. 

1.  Homogenization  Theory.  I  begin  by  a  brief  review  of  the  classical  prob¬ 
lem  of  computing  the  effective  conductivity  of  a  composite  material.  To  fix  ideas, 
consider  a  heterogeneous  material  made  from  two  different  conductors  with  con¬ 
ductivities  <T\  and  (72  and  volume  fractions  f\  and  /2,  respectively.  Mathematically, 
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its  “microscopic”  conductivity  is  described  by  the  function 

(7(x)  =  <ri  xi(x)  +  02  X2(x) ,  (1) 

where  Xi(x)  and  X2(x)  are  the  characteristic  functions  of  the  regions  of  space 
occupied  by  each  of  the  two  constituents.  The  partial  differential  equation  satisfied 
by  an  electric  potential  <(>  inside  this  composite  is 

V  .((t(x))  .  V<l>{x))  =0.  (2) 

The  characteristic  composite  dimension,  L,  is  assumed  to  be  large  with  respect 
to  the  “microscale”,  L  Let  us  introduce  the  small  parameter  e  =  IjL,  where 
L  represents  the  dimensions  of  the  composite.  Then,  defimng  the  macroscopic 
coordinate  x'  =  €x,  we  rewrite  (2)  as 

V  .  {a{x'/e))VM^'))  =0,  (3) 

with 

Let  us  also  make  an  auxiliary  assumption:  that  the  conductivity  cr(x)  is  periodic  in 
with  period  1  in  each  direction,  so  that  the  heterogeneous  material  is  “micro¬ 
scopically  periodic”^.  The  macroscopic  behavior  of  the  composite  is  then  described 
by  the  equation  satisfied  by  <l>o{x')  =  which  turns  out  to  be^ 

V  -(ae//  •  V^o(x'))  =0.  (4) 

This  equation  and,  in  pcirticular,  the  value  of  Oe/f^  can  be  derived  using  the 
homogenization  method  or  method  of  multiple- scale  asymptotics  [BLP],  [SA],  [B]. 
Let  us  expand  formally  the  potential  as  a  series 

9!.,(x'))  =  <^o(x')  +  +e^4^^(x',x7£)  +  ...  (5) 

This  is  a  perturbation  series  in  €,  in  which  the  successive  terms  depend  on  the 
“macroscopic”  variable  x*  as  well  as  on  the  “microscopic”  variable  x  =  x’  j e.  To 
calculate  the  unknown  terms  in  the  expansion,  replace  x'/c  by  x  in  (3)  and  (5),  and 
the  operator  V  by  V*.  Then  substitute  (5)  into  (3)  and,  treating  x'  and 

X  as  independent  variables^  expand  equation  (3)  in  powers  of  €.  This  results  in  a 
hierarchy  of  partial  differential  equations  that  completely  determine  the  functions 
for  aU  k  ([BLP]). 

It  can  be  shown  that  the  effective  conductivity ^  Oe/f,  is  given  by^ 

((e  +  Vx(x)))‘  •  a(x)  •  (e  +  Vx(x))).  (6) 

^The  validity  of  this  assumption  will  be  discussed  below. 

^Strictly  speaking,  we  must  supplement  (3)  with  boundary  conditions  on  the  boundary  of  the 
conductor.  It  can  be  shown  that  the  form  of  the  homogenized  equation  is  independent  of  these 
conditions. 

^For  simplicity,  we  assume  that  the  composite  is  macroscopically  isotropic,  so  that  <7^//  is  a 
scalar.  In  general,  the  effective  conductivity  would  be  a  tensor  (^eff,ij- 
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where  e  is  a  unit  vector  etnd  the  "potential’  x(x)  is  a  periodic  solution  of  the  the 
PDE 

V,  •  (e  +  Vx(x))  =  0  (7) 

The  brackets  ( • )  in  (6)  represent  averaging  over  the  period  cell  of  <t(x).  Equation 
(7)  is  known  as  the  cell  problem  associated  with  the  homogenization  problem  (3). 
The  effective  conductivity  characterizes  the  conduction  properties  of  the  composite 
on  a  macroscopic  scale. 

Homogenization  theory  is  not  restricted  to  periodic  media.  Kozlov  [Ko]  and 
Papanicolaou  and  Vciradhan  [PV]  established  the  existence  of  a  homogenized  limit 
for  equation  (3)  for  very  general  stationary  ergodic  random  conductivities  £7(x)  as 
€  0,  using  the  ansatz  (5).  This  theory  requires  however  that  the  conductivity  be 

bounded  away  from  0  and  co,  i.e.,  that 

0  <  A  <  ct(x)  <  1/A  (8) 

for  some  constcint  A  G  (0,1],  uniformly  in  x.  In  this  more  general  setting,  the 
effective  conductivity  is  obtained  by  solving  (6)  in  a  suitable  Hilbert  space  and 
evaluating  (7).  (The  brackets  in  (7)  represent  then  statistical  averaging.) 

2.  Advection-difFusion.  Let  us  consider  a  different  but  related  physiccJ  problem: 
the  mixing  of  a  passive  scalar  in  an  incompressible,  steady  flow.  Typically,  this 
sccdcir  represents  the  volumetric  concentration  of  a  solute  substance  dispersing  in 
the  fluid  under  the  action  of  the  flow  and  of  molecular  diffusion® . 

The  equations  for  the  concentration  (^(x,  t)  and  the  velocity  ^(x)  are 

+  y(x)  •  VC(x,  t)  =  Z?  AC(x,  t)  (9a) 

and 

V  •  F(x)  =  0.  (9b) 

For  simplicity,  we  assume  that  the  velocity  field  V  is  smooth  and  periodic  in  and 
that  it  has  average  zero  over  the  period:  (  )  =0.  To  compute  the  effective,  large- 

scale  diffusion  rate,  we  make,  as  before,  a  change  of  variables  x'  =  cx,  f'  ~  t 
and  rewrite  equation  (9a)  cis 

+  -y(x7£)  VCe(x',  0  =  jDAC,(x',  t') 
ot’  e 

where 

Ce(x',«')  = 

In  the  case  of  periodic  velocity  fields,  the  homogenization  of  equation  (10)  is 
essentially  equivalent  to  the  problem  of  Section  1,  with  a  minor  modification.  To 


®For  a  background  on  this  problem  see,  for  instance,  Monin  and  Yaglom  [MY]. 
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see  this,  notice  that,  since  V  is  incompressible  and  has  mean  zero,  it  has  a  smooth, 
periodic  vector-potential  satisfying 

V  X  ^(x)  =  V^(x). 


Define  Ein  antisymmetric  matrix  Hij  (x)  by  setting 

Hi2  =  Az  ,  His  =  -As  ,  ff23  =  Ai ,  etc. 

Using  this  matrix,  the  rescaled  equation  (10)  can  be  rewritten  as  a  divergence-form 
parabolic  PDE  with  a  tensor-valued,  non-symmetric  “conductivity”.  Specifically, 
Ce  satisfies 


=  V  ■  [(DI  -h  H(x7€))  •  VCe(x',t')] 
dt' 

where  I  is  the  identity  matrix  and  H  =  Hi,.  Using  the  homogenization  method 
(i.e.  expansion  (5)),  it  can  be  shown  [BLP]  that  C^{x' ,t' )  converges  to  the  solution 
of  the  diffusion  equation 


at' 

with  effective  diffusivity,  D^ff,  given  by® 

D,//  =  D[l  +  (|Vx(x)n],  (11) 

where  x(x)  is  a  periodic  solution  of  the  “ceU-problem” 

V-  [(DI-|-H(x))-(Vx(x)  +  e)]  =0.  (12) 


Generalizing  this  result  to  the  setting  of  random  velocities  V  is  important  for 
physical  applications.  However,  the  adaptation  of  homogenization  methods  to 
advection-diffusion  in  random  velocity  fields  is  a  more  delicate  matter.  This  is 
due  to  the  fact  that  the  natural  stationary  field  is  the  velocity  V  and  not  the  vec¬ 
tor  potential  A.  To  apply  the  homogenization  method  with  random  velocities  one 
must  make  additional  hypotheses.  If  the  vector  potential  is  uniformly  bounded  and 
stationary  -  and  thus  a  coercivity  relation  of  type  (7)  holds  for  the  “conductivity” 
2)  I  q.  H  -  the  theory  of  Kozlov  2ind  Papcinicolaou  and  Varadhan  can  be  directly 
applied  to  show  that  the  tracer  obeys  a  macroscopic  diffusion  equation.  A  general¬ 
ization  of  this  result,  obtained  with  Andy  Majda  [AMI],  shows  that  the  scalar  will 
satisfy  an  effective  diffusion  equation  provided  that  the  velocity  admits  a  station¬ 
ary,  syuare-integrable  vector  potential.  (See  also  Oelschlager  [0]).  The  significance 
of  the  result  is  that  it  is,  in  a  sense,  sharp.  (The  condition  is  also  necessary.) 

6We  assume,  for  simpUcity,  that  the  system  is  macroscopically  isotropic  so  that  the  effective 
diffusivity  is  a  scalar. 
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A  random  vector  field  V  has  a  stationary  vector  potential  if  and  only  if 

R3 


where  R  (k)  is  the  spectrcd  measure  of  V, 

Intuitively,  if  condition  (13)  is  satisfied,  the  “correlation  length”  of  the  stationary 
vector  potential  A  determines  the  “microscopic”  scale  in  the  problem.  Homogeniza¬ 
tion  theory  then  tells  us  that  the  system  behaves  diffusively  at  distances  which  are 
large  compared  to  the  correlation  length. 

However,  the  assumption  of  stationarity  of  the  vector-potential  is  not  always 
suitable  from  a  physical  viewpoint.  I  will  show  that  new  phenomena  arise  if  this 
condition  (13)  fails,  i.e.  if  A  has  larger  and  larger  fluctuations  as  |x|  oo.  In 
the  latter  case,  the  correlation  length  of  the  vector  potential  is  infinite  and,  as  we 
shall  see,  the  macroscopic  motion  is  no  longer  diffusive.  This  puts  the  problem 
beyond  the  reach  of  “classical”  homogenization  theory  and  new  methods  must  be 
introduced. 

3.  Iterated  Homogenization  and  Rigorous  Decimation  Procedures.  Ho¬ 
mogenization  theory  can  be  applied  to  study  multiple- scale  heterogeneous  media 
and  flows.  This  is  cin  important  application  of  the  theory,  which  I  regard  as  a  “pre¬ 
cursor”  of  the  physicists’  renormalization  theory  discussed  in  §7.  Many  situations 
in  Physics  lead  to  the  consideration  of  self-similar  or  fractal  media.  Thee  include 
the  analysis  of  phase  transitions  and  of  tmbulent  mixing  in  flows  with  mciny  active 
energetic  scales. 

Suppose  that  the  microscopic  conductivity  of  a  material  can  be  expressed  in  the 
form 

ae(x)  =a(xe,xeL...,X£^),  (14) 

where  e  is  a  small  parameter  and  the  function  a  is  defined  in  (R^)^  cind  is  periodic 
of  period  1  in  each  variable.  A  “medium”  with  this  conductivity  is  composed  of 
“elements”  of  different  characteristic  lengths,  each  being  much  larger  or  smaller 
than  the  other.  Homogenization  theory  tells  us  (cf.  [BLP])  that  the  solution  of 
the  cell  problem  associated  to  (I4)  can  be  done  recursively^  by  solving  independent 
partial  differential  equations  at  each  scale.  This  is  done  as  if  xe^  ,X€^  , ...  ,x 
were  independent  variables.  To  illustrate  this  idea,  let 

cr,(x)  =S(x€,xe2).  (15) 

represent  the  conductivity  of  a  “two- scale”  material.  Accordingly,  in  order  to  cal¬ 
culate  its  effective  conductivity  as  e  ->  0,  we  should  first  homogenize  the  shorter 
scale  X€,  treating  the  longer  one,  xe^,  as  a  parameter.  The  result  is  a  “partially 
homogenized”  conductivity 

<t(^)(x)  =  (T*  (xe^), 
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where  a*  is  obtained  by  solving  a  cell  problem  of  type  (6),  with  x€  as  the  in¬ 
dependent  variable  and  in  held  fixed,  and  averaging  over  the  corresponding 
period  cell.  In  a  second  stage,  we  should  homogenize  crP(x),  using  again  a  cell 
problem.  The  result  of  these  two  successive  homogenizations  is  exactly  the  effective 
conductivity  of  the  two-scale  material  (15)  as  e  ^  0. 

As  an  application  of  this  idea,  consider  now  a  self- similar  material  with  conduc¬ 
tivity  (14)*  By  this,  I  mean  a  material  for  which  the  spatial  fluctuations  in  the 
conductivity  are  identical  from  one  scale  to  the  next  after  dilatation  by  a  factor  of 
e.  Using  iterated  homogenization,  we  see  that  the  effective  conductivity  of  such  a 
material  can  be  obtained  (in  the  Umit  e  ->  0)  by  iterating  a  map:  i.e., 

=  nofto  ...n(<7o) 

=  nW(ao),  (16) 

where  ao  is  an  initially  given  conductivity  and  a  i-4  ^((t)  is  the  result  of  a  “single- 
step”  homogenization.  This  operation  is  identical  at  every  scale  due  to  the  self- 
similarity.  When  the  map  H  is  known,  we  can  calculate  the  effective  conductivity 
by  doing  successive  iterations.  If  the  number  of  scales  N  is  large  or  infinite,  the 
problem  reduces  to  the  analysis  of  the  fixed  points  of  Q,  ([Mi],  [A]). 

It  is  worthwhile  to  emphasize  that  the  principle  of  iterated  homogenization  hinges 
on  the  large  separation  of  scales  imposed  in  the  microstructure  by  assuming  that  e 
is  a  small  parameter.  This  assumption  leads  to  nice  mathematical  results  but  may 
not  always  be  justified  in  practice. 

This  idea  can  also  be  used  to  evaluate  the  effective  diffusivity  for  advection- 
diffusion  with  velocities  with  multiple  scales.  For  instance,  consider  a  velocity  field 
of  the  form  ^ 

K(x)  =  X;/5"€’*Vo(x€"),  (17) 

n=l 

where  Vq  is  a  given  incompressible  periodic  vector  field  and  e  and  ^  G  (0,1)  are 
parameters.  The  effective  diffusivity  for  (17)  can  be  calculated  exactly  in  the  limit 
£  ->  0  as  follows.  First,  we  observe  that  the  corresponding  Ff-matrix  is  given  by 

JV 

H(x)  =  53 /^"Ho(x€"),  (18) 

n=l 

where  Ho  is  the  H-matrix  for  the  basic  field  Vq.  The  smallest  spatial  scale  corre¬ 
sponds  to  the  term  with  n  =  1.  Iterated  homogenization  allows  us  to  “average  out” 
the  combined  effects  of  molecular  diffusion  and  this  first  advective  scale  indepen¬ 
dently  of  the  larger  scales  (n  >  2).  Let  us  define  the  map 

F{D)  =  homogenized  diffusivity  for  the  operator  Z?  A  +  Vo  '  V . 

The  effect  of  “decimating”  the  smallest  spatial  scale  is  a  modified  diffusion  coeffi¬ 
cient  Di  given  by 

Di 
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We  can  continue  “decimating”  the  successive  scales  n  =  2,3..,.  A  simple  cal¬ 
culation  shows  that  the  diffusivity  Dj,  obtained  after  eliminating  the  scale, 
satisfies 

Dj=l3^F(^).  (19) 

Supplementing  this  relation  with  the  initicil  condition  Dq  =  D,  we  obtain  a  recur¬ 
sion  relation  from  which  the  effective  diffusivity  can  be  expressed  in  terms  of 
D  and  the  “one-step  map”  F  (the  analogue  of  ft  for  advection-diffusion). 

Consider  now  what  happens  if  ?/  =  oo  in  (17).  It  is  easy  to  show  that,  for 
/3  <  1,  the  diflFusion  coefficient  Dn  approaches  a  finite  fimit  as  N  ->  oo.  On  the 
other  hand,  if  /?  >  1,  the  velocity  is  well-defined  for  each  value  of  6  <  1//?,  but 
Djv  diverges  as  N  ^  oo.  For  instance,  if  /3  =  1,  the  iteration  scheme  (19)  is 

Dn  =  F  o  F  o  ...  o  F{D) 

=  (D)  . 

Reccdl,  from  (10),  that  the  homogenized  diffusivity  satisfies  De/f  =  F{D)  >  D 
for  all  D.  Therefore,  the  map  F  increases  the  diffusivity  with  each  step.  Thus, 
Dn  increases  without  bound  as  N  oo.  This  is  an  indication  that  the  effective 
diffusion  coefficient  corresponding  to  advection-diffusion  by  the  velocity 

oo 

e"Vb(xe") 

1 

is  infinite^ .  It  is  noteworthy  that  the  velocity  field  (17)  has  a  uniformly  bounded 
vector  potential  if  /?  <  1,  for  all  €  >  0  and  all  N.  On  the  other  hand,  if  /3  >  1, 
the  vector  potential  of  (17)  diverges  as  N  ^  oo. 

4.  Anomalous  Diffusion  and  the  Renormalization  Problem.  Advection- 
diffusion  in  flows  for  which  the  Icirge-scale  behavior  of  the  passive  scalar  is  su- 
perdiffusive,  i.e.  for  which  the  effective  diffusion  coefficient  is  infinite,  have  been 
the  object  of  much  investigation.  Recent  progress  has  been  made  in  understand¬ 
ing  this  phenomenon  from  a  rigorous  standpoint,  although  a  complete  theory  is 
still  missing.  Anomalous  diffusion  in  incompressible  flows  can  arise  only  when  the 
vector-potential  is  non-stationary,  i.e.  when 

f  ^(k)  ,3, 

J  I  k  P  ^  ^  ~  ^ 

The  large-scale  analysis  of  the  advection-diffusion  equation  (9a)  for  velocities 
satisfying  (20)  can  be  formulated  as  follows.  First,  let  J  >  0  be  a  small  parameter 

^It  is  an  “indication”  rather  than  a  proof,  because  I  took  the  limit  e  0  first  and  then  the 
limit  N  — >  oo,  rather  than  the  other  way  around.  The  same  conclusion  and  caveat  apply  for 
0  >  1. 
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representing  the  ratio  between  the  velocity  correlation  length  and  the  macroscopic 
scale.  We  make  the  change  of  coordinates  x'  =  Jx ,  so  that  x’  represents  distance 
measured  in  the  macroscopic  scale.  Let  us  also  define  a  macroscopic  dme-scale 
by  setting  t'  =  p^{5)  t ,  where  p^{5)  is  a  yet  unspecified  function  of  S.^ 

Unhke  the  cases  treated  before,  the  macroscopic  time-scale  is  expected  to  be  dif¬ 
ferent  than  in  homogenization  theory,  to  account  for  the  infinite  diffusion  coefficient 
when  p  =  S.  We  shall  see  that  this  time-scale  depends  on  the  singularity  of  the 

function  at  |k|  =0. 

Define  the  large-scale  concentration  field 

Ci(x'0  =  (21) 

The  renormalization  problem  ([AM2])  consists  in  i)  finding  a  scaling  function  p(5) 
such  that 

lim  C5(x',t')  =  C'o(x',t')  (22) 

(5-^0 

defines  a  non-trivial  function,  different  both  from  zero  and  from  the  initial  datum, 
cind  ii)  to  chciracterize  the  hmiting  function  Co(x'  ,t'),  by  means  of  an  an  appro- 
priate  evolution  equation. 

An  important  difiference  between  systems  with  stationary  vector  potentials,  sat¬ 
isfying  (13),  and  the  present  ones  is  that,  in  the  former,  the  hmiting  function 
Co(x',t')  is  always  non-random,  due  to  the  “self-averaging”  property  of  vector- 
potentials  with  finite  correlation  length  (PV).  In  contrast,  for  infrared-divergent 
flows  satisfying  (20),  the  limiting  function  Co  is  expected  to  be  a  random  process, 
with  a  distribution  determined  by  a  macroscopic  equation  of  motion. 

5.  Simple  Shear  Flows  with  Infrared  Divergences.  The  simplest  mathemat¬ 
ical  model  to  study  anomalous  diffusion  is  a  simple  shear  flow  [AMS] 

y(xi  ,X2)  =  {U(X2),0)  , 

where 

U{x2)  -  U  j  coskx2N{k)dk.  (23) 

\k\<ko 

The  function  N{k)  in  the  integrand  represents  one-dimensional  Gaussian  white 
noise.  Thus,  U{x2)  is  a  Gaussian  field  with  spectral  function 

I  0,  1A:|  >  fco- 

»The  parameter  5  plays  the  same  role  as  e  in  Sections  1  and  2;  e  will  be  used  to  denote  another 
quantity  (cf.  (23)  ). 
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The  integral  J  R(k)/\k\^  dk  converges  if  c  <  0  and  diverges  for  e  >  0.  The  param¬ 
eter  e  varies  in  the  range  —  oo  <  e  <  2.^  It  controls  the  degree  of  divergence  of 
the  integral  (20)  since  ,  for  e  >  0, 


J  \k\^-y\k\Uk  ^ 


<S<l*l<*o 

and  the  stream-function,  il>{x2),  (the  2D  analogue  of  the  vector  potential)  for  this 
flow  satisfies 

Tp{x2)  «  1x21  »  1. 

Let  us  derive  the  macroscopic  evolution  equation  for  this  system.  Due  to  the 
anisotropy  of  the  random  shear  flow,  it  is  convenient  to  define  Cs  in  a  slightly 
different  way  than  (21),  namely, 

C,(xi,x',t')  =  S-^p-^  C{x[/5,xyp,t'lp^). 

Using  equation  (9a),  we  find  that  the  PDE  satisfied  by  C5  is 

'  d{x[)^  d{x'^f  J’ 

A  choice  must  now  be  made  for  the  scaling  function  p{S).  We  choose 

p(5)  =  j2/(2+«)  ^25) 

for  reasons  that  will  become  immediately  clear.  Using  this  value  of  p,  we  can  rewrite 
equation  (24)  as 

dC^,x',  ,t')  rr,  .  dCs{x[ , x'  , ) 

D  f  ^  1^2  )  ,  d'^Ci{x\  ,X2  ,t')  , 

d{x\Y  _  J-  (26) 

We  observe  that  the  random  field 

p‘l^-^U{x'2lp) 

converges  in  distribution  as  p  0  to  the  Gaussian  field 

+00 

Uo{x'2)  =  U  y  |fc|<*-‘)/"cosfcx'Ar(/c)dfc. 


^The  upper  bound  e  <  2  is  imposed  so  as  to  have  a  velocity  with  finite  “energy”  (  |U(  x  )p  )  . 
In  [AM  3],  a  wider  range  0  <  c  <  4  is  studied  by  introducing  a  natural  infrared  cutoff  for  the 
velocity  statistics. 
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This  is  a  generalized  (distribution-valued)  random  field,  in  the  sense  of  Gel’fand 
and  Vilenkin  [GV].  Therefore,  formally, 

lim  Cs  ^X2  )  =  Cs  yX2  it  ) , 

5-vO 

where  Co  satisfies  the  stochastic  PDE 

dCoix[,x',,n  ,  dCoix[,x',,t')  ^  d^Co{x[,xyn  (27) 

- -  +  t^o(*2)  9^-  ^  a(x^)2 

The  recuier  may  wonder  what  is  the  meaning  of  a  partial  differential  equation  in 
which  one  of  the  coefficients  is  a  random  distributionl  Nevertheless,  it  c^  be  proved 
rigorously  that,  in  spite  of  the  presence  of  the  singular  coefficient  tfo(a:2)-  equation 
(27)  has  a  weU-defined  solution  which  can  be  expressed  as  an  expectation  over  a 
Wiener  paths  and,  moreover,  that  this  equation  characterizes  the  limit  of  Cs  as 
J  ->  0.  The  solvability  of  (27)  justifies  the  choice  of  the  scaUng  function  and  is 
reaUy  the  essence  of  the  renormalization.  The  key  element  needed  to  solve  (27)  is 
the  fact  that 


J  UoiPis)) 


is  a  well-defined  random  variable  with  finite  moments. 

As  an  application  of  this  result,  let  us  calculate  the  averaged  Green  function, 
which  describes  the  evolution  of  the  statistical  mean  of  the  random  function  Co- 
This  is  done  by  solving  (27)  and  averaging  over  velocity  statistics^®.  Accordingly, 
we  find  that 

{Co{x[,x'2,t'))  =  J  J  P{x[-yi,x'2-y2,t’)Cinit(yi,yi)dyidy2, 

where  Cinit  represents  the  initial  condition  and  where  the  Fourier  transform  of 
P  (x'j  ,  Xj  .  *' )  is  gi’^®*' 

P(ki,k2,t')  =  exp{-D(k2ft'}x 


E { exp  [ 


1/2  ((')(i+'/2)  (fci) 


2(2-€/2)  £)( 


'1^  J  J  F,(/3(s)  -  I3{s'))dsds'  ]  } 


In  this  last  equation,  the  expectation  is  taken  with  respect  to  Brownian  Motion, 
P{s)  ,0  <  s  <  1,  and  the  function  Fc  which  appears  in  the  exponent  is  given  by 


+  00 

.(y)  =  /  \k\^-^e^'‘^dk 


fore/1 

Cl  S{y)  fore  =  1 , 


^The  higher-order  moments  of  Co  can  be  computed  in  a  similar  fashion  . 
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where  and  Ci  are  numerical  constants  and  5{y)  is  the  Dirac  function.  The 
function  (28)  is  very  far  from  being  a  Gaussian  propagator  exp  (—  Dg//  Ikp  t' ).  It 
has  the  form 

P{ki,k2,t')  =  $[Z»-(i-'/2)(fci)2(t')i+'/2]  .  exp{-D{k2ft'}. 

This  Green  function  does  not  correspond  to  a  “closed”  evolution  equation  for  the 
average  scalar.  (On  the  other  hand,  the  unaveraged  scalar  follows  a  stochastic 
PDE.) 

Recently,  Majda  obtained  a  complete  characterization  of  the  distribution  of  the 
concentration  in  the  case  of  time- dependent  t^dom  shear  flows  ([Mai]). 

6.  “Nearly  Stratified”  Flows;  Rigorous  Assessment  of  the  Infiuence  of 
High- Wavenumber  Components  on  Macroscopic  Scales.  The  previous  re¬ 
sult  can  be  extended  to  a  wider  class  of  flows  with  infrared  divergences.  These 
“nearly  stratified”  flows  are  no  longer  simple  shear  flows,  but  have  the  same  type  of 
spectrum  for  |k|  -C  1.  Since  this  is  the  spectral  region  which  “controls”  the  diflFu- 
sive/superdiffusive  behavior,  we  expect  qualitatively  the  same  results  as  for  the  sim¬ 
ple  shear  flows^^.  However,  unlike  in  the  previous  example,  the  high-wavenumber 
components  of  the  flow  also  “participate”  in  the  renormalization  [AM4]. 

The  nearly  stratified  flows  have  the  form 

V"(xi  ,X2  )  =  {U{X2)  +  Ui  (xi  ,X2  )  ,U2{xi,X2))  (29) 

where  U  is  defined  in  (23)  and  U  =  {Ui  ,C/2)  is  an  incompressible  vector  field 
which  satsifies  condition  (13).  Notice  that  the  associated  Lagrangian  trajectories 
are  non-trivial  and  the  advection-diflFusion  equation  cannot  be  solved  by  separa¬ 
tion  of  variables.  My  goal  in  this  section  is  to  point  out  that  the  effect  of  the 
high- wavenumber  component  t/'(x)  on  the  on  the  macroscopic  scale  is  simply  to 
renormalize  the  transverse  diffusion  coefficient 

This  can  be  seen  intuitively  by  considering  the  Lagrangian  formulation  of  the 
problem  [AM4],  A  Lagrangian  particle  moving  with  the  flow  (29),  with  molecular 
diffusivity  D,  will  perform  a  normal  random  walk  in  the  a:2-direction  at  large  scales, 
and  behave  superdiffusively  in  the  xi-direction.  These  two  phenomena  occur  at 
different  time  scales.  In  fact,  the  diffusive  motion  is  achieved  at  distances  which  are 
large  compared  to  the  (finite)  correlation  length  of  U2  {xi  ,0:2  ),  but  still  relatively 
short.  Because  of  this,  the  superdiffiisive  behavior  in  the  a;  1 -direction,  which  is 
determined  by  the  infinitesimal  region  of  the  spectrum  near  |k|  =  0  and  hence 
by  very  large  wavelength  fiuctuations,  is  like  in  Section  5.  The  system  behaves  as 
if  the  Lagrangian  particle  component  X2(t),  was  distributed  as  Brownian  Motion 
independent  of  U(x2)y  with  an  enhanced  diffusivity.  The  macroscopic  time-scale 
p{S)  which  characterizes  superdiffusion  in  the  xi  direction,  is  imchanged. 

“In  the  language  of  the  renormalization  group  theory,  the  high  wavenumber  components  are 
“irrelevant”  insofar  as  the  critical  exponents  are  concerned. 
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The  macroscopic  evolution  equation  for  the  function  Cq  for  “nearly  stratified” 
flows  turns  out  to  be 


d{xi,)^ 


where  D^,.  is  a  homogenized  transverse  diffusivity  (see  [AM4].  Thus,  the  system 
behaves  m  if  there  was  a  “separation  of  scales”  between  the  “infrared- divergent” 
component  {U{x2),Q)  and  the  “infrared-convergent”  component,  (t/i(xi,X2). 
1/2  (xi  ,X2  ))•  The  situation  is  actually  more  subtle  and  shows  a  notable  difference 
between  iterated  homogenization  and  renormalization.  In  fact,  depends  on 

the  random  velocity  Cf(xi)  as  well  Thus,  equation  (30)  does  not  arise  from  an 
iterated  homogenization  as  presented  in  §3  —  the  “longer”  scales  are  still  weakly 
coupled  to  the  shorter  ones  and  play  a  role  in  determining  the  effective  diffusmty. 

To  be  more  specific  about  this  last  point,  I  exhibit  this  interaction  explicitly.  It 
is  proved  in  [AM4]  that  the  transverse  effective  diffusivity  in  (30)  is  obtained  by 
solving  the  “cell  problem” 


dx{^iy^2 


and  that 


=  -^2(3:1, X2), 

Di-,,=D  -  [l+dVxH]- 


Notice  that  U{x2  )  is  present  in  equation  (31). 

The  renormalization  for  nearly-stratified  flows  has  been  instrumental  for  ana¬ 
lyzing  more  complicated  systems.  For  instance,  AveUaneda,  Elliott  and  Apelian 
([AEA])  used  it  to  explain  the  scaling  behavior  of  random  advection  with  zero 
diffusion  in  the  presence  of  a  mean  flow  (i.e.  (  F  )  ^  0)- 

7.  Renormalization  Group  Analysis:  a  Non-Rigorous  Decimation  Pro¬ 
cedure.  In  this  section,  I  return  to  the  renormalization  problem  formulated  in  §4 
for  Motropic  infrared-divergent  velocities.  The  degree  of  infrared  divergence  can  be 
quantified  using  the  exponent  e,  as  in  the  shear-flow  models.  We  shall  consider  an 
incompressible,  isotropic,  Gaussian,  d-dimensional  random  velocity  field  y(x)  with 

spectrum  ^(k)  =  ,  |k|  <  fco ,  (32) 

where  the  parameter  range  is  0  <  e  <  2. 

It  is  easy  to  see  that  for  any  number  0  <  0  <  1,  such  a  velocity  can  be  repre¬ 
sented  cis  sm  infinite  series 


F(x)  =^r(»-')F„(x^") 


(33) 
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where  the  Vn  are  independent,  identically  distributed  Gaussian  velocities  with  the 
same  band-hmited  spectrum 

k )  =  y  ikp-''-') ,  0  ko  <  |k|  <  ko . 

Notice  that  the  velocity  (33)  is  analogous  to  the  ones  studied  in  §3.  Because  the  sin¬ 
gularity  of  the  ratio  R{k\  )/|kp  is  the  key  feature  in  this  problem,  we  are  naturally 
led  to  considering  multi-scale,  self-similar  flows.  However,  an  important  difference 
between  this  problem  and  iterated  homogenization  theory  is  the  absence  of  “sepa¬ 
ration  of  scales”.  Notice  cdso  that,  since  6  <  1 ,  the  H-matrix  associated  with  the 
velocity  in  (33)  is  a  divergent  series 

oo 

H(x)  H„(x^"),  (34) 

0 

where  is  the  H-matrix  of  Vn. 

The  renormalization  group  analysis  ([FNS],  [R],  [AN])  is  a  non-rigorous  approach 
to  study  this  problem,  which  is  based  on  successive  decimations.  The  idea  is  to  guess 
the  scaling  exponent  (or,  equivalently,  p(5))  by  analyzing  recursively  the  effects  of 
successive  scales  on  the  evolution  of  Cq.  I  shall  focus  on  the  renormalization  group 
analysis  of  the  effective  diffusivity  of  the  “truncated”  velocity 

N 

0 

This  coefficient  can  be  viewed  (in  the  context  of  the  renormalization  group  analy¬ 
sis)  as  describing  the  effect  of  the  top  N  scales  on  the  macroscopic  concentration 
field.  Renormalization  group  theory  makes  the  hypothesis  that  each  scale  influences 
the  longer  ones  primarily  as  a  “homogenized”  coefficient,  i.  e.  through  a  map. 
This  should  be  understood  as  an  “approximate  homogenization”,  because  i)  6  is 
not  an  infinitesimal  quantity  and  ii)  we  have  shown  in  the  previous  section  that 
the  renormalized  diffusivity  corresponding  to  high-wavenumber  components  is  not 
independent  of  the  larger  scales  (n  >  N).  Such  hypothesis  leads  to  a  recurrence 
relation  for  the  successive  diffusivities  Di,D2, etc.  Let  us  make  this  relation 
explicit  by  introducing  the  map 

F(D)  =  “homogenized”  diffusivity  of  the  operator  DA  -f-  Vi  •  V.  (36) 
Like  in  (19),  the  recurrence  relation  has  the  form 

Dn  =  0  <  n  <  TV  (37a) 

or, 

A„  =  Dn  r ^ ,  \n  =  F[  r  A„_1 ) .  (37b) 

To  analyze  this  relation,  we  use  the  fact  that  F{D)  is  a  monotone  increasing  function 
satisfying  iF(D)  /  Z?  >  1  and  that  F{^D)  J  D  1  as  £)  — >  oo,  as  well  as  the  fact 
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that  0  <  1.  It  foUows  that  the  function  X  F(0"A)  has  a  unique  fixed  point, 
A*.  We  conclude  from  {37b)  that 


Dn  a  A*r'^ 


N  >  1. 


This  gives  an  estimate  of  the  contribution  of  the  high-wavenumber  components 
( Ikl  >  koO’^)  to  the  diflrusivity,  suggesting  that  the  effective  diffusmty  is  infimte 
as  W  00  and  diverges  in  a  precise  way  as  the  infrared  cutoff  (35)  is  removed^ 
RecaUing  that  diffusivity  has  dimensions  of  P  jT  and  that  9^  k,  h^  imensions  of 
L,  (38)  suggests  the  space-time  scaling  for  the  macroscopic  concentration 

^2  ^  yi/(i-€/4)^  r>l,  (39a) 

or  in  terms  of  the  time-scahng  function  p{5)  of  the  renormalization  problem. 


These  scaling  relations  are  in  exceUent  agreement  with  Monte-Carlo  simulations 
using  Lagrangian  particles  [AG]. 

8  An  Approximate  Evolution  Equation  for  Co  in  the  case  of  Isotropic, 

Infrared-Divergent  Velocity  Fields.  The  renormalization  probkm  for  isotropic 

velocity  fields  with  spectra  (32)  is  still  a  mathematically  open 

have  used  renormalization  group  analysis  as  well  as  diagrammatic  methods,  or 
renormalized  perturbation  theories  ([Kr],  [KB]),  to  derive  approximate  the  evo¬ 
lution  equations  for  the  mean  statistics  (  Co(x'  ,t') )  •  These  procedures  are  ^ 

consistent  with  the  scaling  (39b)  but  differ  in  the  form  of  the  effective  equations  . 

I  shall  speculate  on  the  form  of  the  evolution  equation  for  the  unaveraj/ed  concen¬ 
tration,  assuming  that  (39b)  is  the  correct  scaling. 

The  PDE  satisfied  by  the  function  C«(x'  ,t')  defined  in  (21)  is 

dCsjx.'  ,t')  .  VC«(x',t')  =  (SIp)^  D  ACsix.'  ,t'). 

dt' 

Substituting  the  scaling  function  p{S)  from  (39b),  we  obtain  the  PDE 

dCs{x',t')  C'i(x' ,  f')  =  5'^"  D  A  Csix!  ,t') .  (40) 

dt' 

This  leads  to  the  following  observations:  first,  the  random  field 

^e/2-l  y(J-l  x' )  (41) 

l^Maida  fMa2]  obtained  exact  solutions  for  the  first-order  moment  of  Co 
velocities  with  f  Jt  time-decorrelation.  This  is  a  different  problem  that  the  one  for  steady 

tid  Maida  IAM5]  put  to  test  the  vididity  ^ 

ries  and  of  the  renormalization  group  by  comparing  then  predictions  in  the  context  exactly 

renormalizable  models. 
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converges  in  distribution  to  a  Gaussian  random  field  Vo{x')  with  spectriun 


J^(k)  = 


k  6  R". 


Note  that  since  0  <  c  <  2,  the  function  -R(k)  is  not  integrable  and  hence  Vo(x^) 
is  a  distribution-valued  random  field.  Second,  D  ACs{x.\t')  converges  weakly 
to  zero  in  the  Hilbert  space  (R"' ,  <  •  > ) ).  We  are  thus  led  to  the  formal 

conclusion  that  the  large-scale  concentration  should  satisfy  the  first-order  stochastic 
PDE 

+  yo(x' )  •  V  Co(x' ,  t')  =  0 .  (42) 

However,  this  reasoning  leads  to  serious  difficulties:  first,  due  to  the  absence  of  any 
smoothing  in  the  right-hand  side  of  (42),  the  mecining  of  advection  by  a  distribution¬ 
valued  random  function  is  unclear^'^.  The  vector  field  Vq  is  far  from  satisfying 
the  usual  requirements  for  well-posedness  of  the  initial- value  problem  (classically, 
Lipschitz  continuity  is  required).  Thus,  equation  (42)  is  essentially  meaningless^®. 
Furthermore,  the  passage  to  the  hmit  in  (40)  requires  evaluating  the  fimit  of  the 
product  of  the  singular  field  with  V  Cs{x'  The  regularity  of 

the  gradient  of  the  solution  is  not  expected  to  be  sufficient  to  justify  the  above 
procedure.  A  more  sophisticated  renormalization  is  needed  to  make  sense  of  the 
fimit  of  (40)  as  (5  ->  0.  Since  numerical  experiments  leave  little  doubts  as  to  the 
validity  of  the  scaling  (39b),  I  will  use  heuristic  ideas  to  guess  the  form  of  the 
effective  evolution  equation. 

As  indicated  by  the  examples  of  Sections  5  and  6,  the  large- sccile  evolution  is 
determined  by  dissipation  from  the  high- wavenumber  components  (with  |k|  > 
constant)  in  conjunction  with  the  advection-dominated  transport  from  the  infrared 
portion  of  the  spectrum  (|k|  1).  The  renormalization  group  analysis  suggests 

that  the  diffusivity  “contributed”  by  Fourier  modes  with  |k|  >  /c  should  be 

D{  k)  a 


Naively,  we  could  implement  this  idea  by  introducing  a  fractional  Laplacian^®, 
_|k|2-e/2  _  right-hand  side  of  (42).  The  corresponding 

evolution  equation  would  then  be 


^(7o(x'  ,t') 


+  Vb(x').  VCo(x',0  =  -7(M(-A)'"'^Vo(x^O 


where  7(  fco)  is  a  non-uni  versa!  constant  depending  on  the  high- wavenumber  cutoff 
ko  in  (32).  Although  equation  (43)  is  consistent  with  the  symmetry  and  scaling 
of  the  problem  (it  is  invariant  under  the  scaling  transformation  associated  with 


Herein  lies  the  difference  between  the  renormalization  for  isotropic  velocities  and  the  exam¬ 
ples  of  Sections  5  and  6. 

^^This  applies  to  e  is  the  range  0  <  c  <  2.  In  Avellaneda  and  Majda  [AM2],  it  is  shown  that 
(42)  is  the  correct  renormalization  for  the  regime  2  <  c  <  4,  which  I  don’t  discuss  here. 

^®The  fractional  Laplacian  is  the  infinitesimal  generator  of  the  so-called  Levy  processes  in 
Probability  Theory. 
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it  presents  a  serious  drawback.  In  fact,  it  can  be  shown  that  (43) 
leads  to  infinite  second-order  moments  for  Lagrangian  mean-square  displacements 
due  to  the  “fat  tails”  of  the  Levy  distribution:  under  (43),  we  have 

J  |xf  (Co(x',t'))d‘'x'  =  00. 

This  divergence  is  inconsistent  with  numerical  simulations.  (Some  authors  have 
proposed  heuristic  renormahzations  associated  with  “Levy  flights  corresponding 
to  modified  Levy  operators  [SK]). 

An  approximation  that  does  not  lead  to  this  last  problem  can  be  obtained  by 
making  an  smalogy  with  the  effective  equation  for  nearly  stratified  flows  (30).  Ac¬ 
cordingly,  consider  the  equation 


^go(^''y''^'.)+y„(y-).V,,Co(x>.y'.t')  =  - 7( M  ( -Ay  Co(x' .y' ,<') 


with  initial  condition 


Co(x',y',t'  =  0)  =  Ciniti^')  x'€R^y'eR^. 


In  this  equation,  I  use  an  additional  variable,  y’,  to  model  the  action  of  the  high- 
wavenumber  components  on  the  large-scale  concentration.  This  represents  a  weak 
coupling”  approximation  for  the  interaction  between  advection  and  dissipation  at 
large  scales.  The  “macroscopic  concentration”  is  defined  as 

Co(x',t')  -  Co(x',0,i').  (45) 


Equation  (44)  mimics  the  equation  for  the  concentration  in  nearly- stratified  flows 
(30),  with  the  effective  transverse  diffusivity  replaced  by  the  operator 

-7(A:o)|ky|-"/^  It  is  not  hard  to  show  that  this  approximation  produces  finite 
mean-square  displacements  for  Lagrangian  particles.  I  do  not  claim  that  equa¬ 
tions  (44)  and  (45)  represent  an  exact  solution.  They  can  be  viewed  as  a  “closure 
for  calculating  the  statistics  of  the  large-scale  concetration.  As  other  approximate 
solutions,  this  one  has  only  heuristic  value.  For  instance,  in  this  approximation, 
the  Fourier  transform  of  the  averaged  Green  function  is 


11 

P{k,t')  =E{exp[  j  J  F,{Y{s)  -Y(s'))dsds'  ]  }, 

where  E  represents  expectation  value  over  over  the  d-dimensional  Levy  process 
Y(s)  and  (y' )  is  a  homogeneous  function.  The  higher-order  moments  of  Co  ( x' ,  t' ) 
can  be  calculated  easily  as  well.  The  implications  if  this  approximation  will  be  in¬ 
vestigated  elsewhere. 

In  conclusion,  I  have  attempted  to  show  the  relationship  between  homogeniza¬ 
tion  theory,  a  rigorous  method  for  averaging  partial  differential  equations,  and  the 
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renormalization  group,  a  heuristic  method  designed  to  investigate  scaling  behavior 
in  self-similar  systems.  There  are  important  connections  between  these  two  ap¬ 
proaches.  However,  the  assumption  of  scale  separation  invoked  in  homogenization 
is  not  adequate  for  treating  the  most  genersd  problems  of  transport  and  diffusion 
in  self-similar  random  media.  The  problem  of  advection-diffusion  in  a  random  ve¬ 
locity  field  showcases  the  difficulties  which  arise  when  the  velocity  has  self-similar 
characteristics  and  a  singulcir  spectrum.  Approximate  effective  equations  based  on 
exact  solutions  for  model  problems  can  be  derived.  However,  the  systematic  im- 
derstanding  of  anomalous  scaling  phenomena  remains  an  open  problem  in  Applied 
Mathematics, 
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